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Basic Fluid-Dynamic Considerations in the 
Drawing of Optical Fibers 


By F. T. GEYLING 


(Manuscript received January 9, 1976) 


The optical fiber drawing process is considered in its totality—from 
source to forming zone to draw-down region and take-up end—as a prob- 
lem in fluid dynamics. Fiber drawing of most glasses is dominated by 
viscous stresses, surface tension effects, and quenching rates. This con- 
trasts with the drawing of textile fibers, where other fluid properties and 
non-Newtonian effects can play important roles. Preliminary time-in- 
variant “‘base flow” models are developed for glass drawing, using the one- 
dimensional, small-slope approximation of extensional flow. First-order 
sensitivities of these base flows to changes in operating conditions are 
examined via a stability analysis. Two important instability mechanisms, 
denoted as the tensile and capillary modes of dynamic fiber response, are 
discussed. Several follow-on objectives arising from this study are described. 


1. INTRODUCTION 


Stringent tolerances set on optical fibers used in communication 
systems have generated a need for understanding the fluid dynamics 
of the fiber drawing process. The responses of this process to various 
disturbances, especially those resulting in perturbations of the fiber 
diameter, are of interest. 

For steady-state drawing, one seeks analytic models that interrelate 
the draw-down ratio, draw force, flow rate, and some characteristic 
temperature. Such models serve two purposes: they predict the effects 
of changes in the operating parameters and can therefore be used in 
the control of industrial drawing processes; they also provide a ‘‘base 
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state’ for the theoretical study of dynamic responses. Such responses 
could be due to a variety of physical disturbances; for example, 
mechanical vibrations, thermal transients, ambient gas flow, and even 
acoustic noise. All of these disturbances may produce variations in the 
diameter of the finished fiber. 

In studying the perturbations of liquid fibers, our philosophy departs 
somewhat from the more traditional one of stability analysis in textile 
engineering. Textile fiber studies, in general, strive to avoid fiber 
rupture and gross distortions of the thread line. (See, for example, 
Refs. 1 through 9.) In cases where continuous drawing of textile fibers 
is impossible, much emphasis is placed on the prediction and control 
of thread length, i.e., the filament lengths attainable between spon- 
taneous ruptures. 

The drawing of optical fibers takes filament continuity for granted. 
However, the ultimate optical application is sensitive to small diame- 
tral perturbations, far from rupture. Moreover, optical fibers are 
usually drawn at higher viscosities and draw forces than textile fibers. 
Starting from a much higher melt temperature, the subsequent viscos- 
ity increase due to quenching of the glass is much more severe than in 
polymers. Also, in its molten state glass is more nearly Newtonian than 
most polymers. 

Our purpose, then, is to model the mechanisms by which perturba- 
tions arise in the glass-drawing process and are frozen into the finished 
fibers. Based on this understanding, we hope to control fiber dimensions 
within the tolerances imposed by optical considerations (e.g. Ref. 10). 

The fluid dynamics of fiber forming involve a source flow, also re- 
ferred to as the forming zone, and a draw-down region. The forming 
zone is usually charactcrized by a rapidly contracting flow issuing 
from the bottom of a preform, or pulled from an orifice at the bottom 
of a crucible (Fig. la). In drawing from preforms, we may distinguish 
between a very sharply contracting configuration (Tig. 1b), commonly 
encountered with laser heating, and a more gradual contraction that 
results from furnace heating (Fig. 1c). We include in Fig. 1d the case 
of an overheated preform, where the forming zone consists essentially 
of a liquid drop from which the filament is drawn. The different 
forming-zone configurations shown in Fig. 1 can imply fundamental] 
differences in the flow field, as far as the steady-state and potential 
instabilities are concerned. For example, there is some evidence! to 
suggest that the overheated preform, Fig. 1d, is capable of self-sus- 
tained oscillations at critical draw speeds, whereas, at noncritical 
speeds, the liquid reservoir in the pendant drop tends to absorb per- 
turbations coming from the take-up end. 

The forming zone makes a continuous transition to the draw-down 
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Fig. 1—Fiber-drawing methods. 


region, where an essential part of the fiber contraction occurs. Typical 
draw-down ratios, defined as terminal velocity/source velocity, in this 
region run between 10 and 100. Most fiber stability studies place major 
emphasis on the draw-down region, because of its physical importance 
and mathematical tractability. However, the role of incipient perturba- 
tions in the forming zone cannot be ignored. The essential need for 
modeling this part of the flow field, if only by numerical simulations, 
is obvious. For completeness, we also recognize that a very small 
amount of fiber deformation occurs beyond the draw-down region; but 
this falls within the visco-elastic rather than the fluid-dynamic regime. 

Much of the empirical evidence in textile and glass-fiber drawing?! 
suggests that fluid dynamics in the draw-down region is governed by 
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the so-called Trouton viscosity, a measure of the ability of fluid 
filaments to sustain tensile stress as a function of elongational strain 
rate. Indeed, if fiber breakage occurs in the draw-down region, it 
exhibits the neck-down and cohesive separation reminiscent of tensile 
test specimens. Much of the early modeling by Ziabicki!?13 and sub- 
sequent efforts by Pearson,*!4 Kasé,®? and others are based on this 
notion, and for this reason we refer to these analyses as “tensile” 
models. 

In parallel, and possibly quite independent of the tensile mechanism, 
local flow conditions may exist where surface-tension-driven phe- 
nomena play a significant role. Such ‘‘capillary”’ responses could occur 
at the hot tip of the forming zone, where the viscosity is still quite low 
and the filament begins to take shape. Capillary models of filament 
response go back to Rayleigh’s classical work,!!° with subsequent 
extensions and experimental corroboration by, for example, Taylor,? 
Tomotika,!":18 and Weber.” 

The present paper serves several purposes: (¢) it generates pre- 
liminary base-flow models for the draw-down region, using the one- 
dimensional, small-slope approximation of elongational flow; (72) it 
makes a preliminary assessment of sensitivities, through differences in 
base flow, to changes in operating conditions, such as take-up speed 
and quenching profile; (777) it presents the tensile and capillary models 
of dynamic fiber response as fundamentally distinct mechanisms; it 
explores their applicability to different parts of the draw process by 
suitable modifications and extensions of existing theories; (7v) it pro- 
jects several follow-on efforts aimed at unified models of the steady 
and perturbed drawing process, viz. more realistic base-flow models, 
including heat transfer and two-dimensionality at the start of the 
draw-down region, and transient-response models, which account for 
these refinements in the base flow together with possible interactions 
between capillary and tensile mechanisms. 


Il. REVIEW OF EARLIER WORK 


In this section, we discuss some of the literature on tensile and 
capillary stability analyses of liquid filaments. Table I relates several 
key publications and identifies their underlying assumptions and 
physical models. 

The first comprehensive study of tensile fiber models was undertaken 
by Ziabicki et al.!~41?13.20 Motivated by the textile engineers’ interest 
in potential instabilities and fracture mechanisms, the authors reviewed 
existing phenomenological evidence on filament ‘‘spinning.’’* They 


*This traditional terminology, which suggests twisting a fibrous material into 
strands, will be avoided henceforth as inappropriate to the drawing of liquid filaments. 


1014. THE BELL SYSTEM TECHNICAL JOURNAL, OCTOBER 1976 


Table 1— Overview of tensile and capillary fiber models 





Tensile models, Viscosity = 3 u, inertia, | Viscosity = 3 u, Viscosity = 3 u, 
physical features surface tension, stability analyses. stability experiments. 
discussed. quenching, finite . 

draw-down. 

Pertinent literature | Refs. 1-4, 12, 13, 20: Refs. 5, 14: dynamic Refs. 6, 7, 21, 22, 25: 
analysis and experi- analysis of attenuat- experiments corrob- 
mental corroboration ing fibers, including orating cyclic and 
of relevant physical the effects of viscos- transient responses of 
factors in base state ity perturbations and tensile fiber models. 
and some dynamic different base states. 
perturbations. 

Capillary models, Newtonian viscosity Inertia and surface Newtonian viscosity, 
physical features and surface tension. tension. inertia, and surface 
investigated. tension. 

Pertinent literature Ref. 16: cylindrical Refs. 16, 19: theory of Refs. 16, 17, 18, 19: 

filaments neglecting inviscid capillary complete theories of 
inertia, fluid cylinders. capillary jets. Modifi- 


cation for quenching 
effects given here. 
Refs. 17, 18: adaptation 
to contracting flows 
and quenching effects. 


Note: Numerous authors have conducted experiments over the years to dem- 
onstrate instabilities of capillary jets and filaments under isothermal, i.e., 
constant viscosity, conditions. 





also cite experimental evidence that when a filament is formed, say 
in the wake of a free-falling viscous drop, either a tensile (‘cohesive’) 
fracture or capillary separation may sever the flow. 

In Ref. 12, a suitable tensile theory is developed for the prediction of 
finite filament lengths, assuming the cross-sectional distribution of 
longitudinal velocities to be uniform and the steady-state flow to con- 
tract with small slope. Reference 20 accomplishes the same for capillary 
effects by an adaptation of Rayleigh’s classical theory (as we have 
done independently for our own purposes). Ziabicki’s subsequent 
publications include a thorough evaluation of experimental results, 
corroborating his predictions of filament lengths.* 

Ziabicki’s comprehensive effort was followed by a series of papers 
by Pearson and others, aimed predominantly at tensile stability models. 
Starting from the simplest possible representation!*—an isothermal 
filament under constant viscous tension—progressively more elaborate 
results were achieved by adding fiber quenching, inertia, surface 
tension, and gravity. A physical interpretation of this work, however, 
is difficult since the explicit features of diametral perturbation profiles 
along the fiber, their time dependence, and their sensitivity to proper- 
ties of the base state seem poorly understood. We will return to this in 
Section VII. Ziabicki’s tensile flow models and the draw-resonances 


* Like other Western readers, the author has been somewhat late in fully recogniz- 
ing the significance of Ziabicki’s work, much of which was initially recorded in Polish 
journals. Note, however, Ref. 4 for a more recent, comprehensive account. 
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predicted by Pearson et al. were corroborated by several experimen- 
ters. We cite the work of Kasé”2 and Donnelly and Weinberger®® as 
examples in this area. 

As mentioned earlier, the study of surface-tension-driven perturba- 
tions contributes another, as yet separate, view of fiber stability which 
goes back to Rayleigh’s classical work as presented in Refs. 15 and 16. 
Rayleigh himself studied several simplified cases: (2) constant viscosity 
plus surface tension (no inertia), (72) inertia plus surface tension (no 
viscosity), and (722) constant viscosity plus inertia plus surface tension. 
Each analysis yields an exact solution of the hydrodynamic perturba- 
tion equations for an incompressible liquid contained by a cylindrical 
boundary with surface tension. Simplifying assumptions are made only 
in solving the characteristic equations. 

Weber” showed that exact solutions of the characteristic equation 
differ little from Rayleigh’s approximation. He also considered the 
effect of aerodynamic drag on the perturbed filament. Tomotika‘?:!8 
extended Rayleigh’s model to allow for an ambient viscous fluid that 
surrounds the filament and is subjected to a steady elongational flow. 

For our own purposes, we need to modify Rayleigh’s and Tomotika’s 
work to reflect not only contraction in the base flow but also the viscos- 
ity buildup due to quenching. These are essential features of such 
“capillary” models of fiber drawing and are therefore listed explicitly 
in Table J. A display of diametral response profiles along the fiber and 
their dependence on wavelength and base-flow properties is given for 
comparison with tensile stability models. 


Il. FUNDAMENTAL EQUATIONS AND THE ASSUMPTION OF 
ONE-DIMENSIONAL FLOW 

Let z 

r 


axial coordinate 
radial coordinate 


v = axial velocity component 


radial velocity component 
fluid density, assumed constant 


I 


surface tension, 


-e Qpoe 
Il 


Newtonian viscosity, a function of temperature. 


In the cylindrical coordinate system (r, z), the Eulerian equations of 
mass and momentum conservation read: 








ve + up + - =0 (1) 
OT; 1 OT rz 1 
plue + uur + vuz| = aR Bee a a2 pT (2) 
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fe) z re) TZ 1 
pLve + uv, + vv,] = a + SL ays, (3) 


or Tr 





where 7;, 72, Trz denote stress components. In all other instances, the 
subscripts 7, 2, ¢ denote partial derivatives. 
The constitutive relations for an incompressible Newtonian fluid are 


ov 


t= —pt Qu oe 
(4) 


U 
™% = —P + 2u- 


ov , ou 
Trz = B (= ale a } 
where p is the pressure. It is one of the dependent variables, along 
with u, v, and the free surface configuration r = a(z, t). Equations (1) 
to (3) have the following boundary conditions: 
Atz=0: 
u(r, 0, t) = vo(7, t) 
u(r, 0, t) = uo(r, £) (5) 
a(0, t) = ao(t). 
At z = L, the take-up position: 


v(r, L,t) = vz(r, t). (6) 


At r = a(z, t): the kinematic condition 


da da 
a aes 7 
ea vie aa (7) 


together with tangential and normal surface-stress conditions, which 
we do not reproduce in detail at this point. (See Appendix A.) Once a 
solution of this boundary value problem has been found, the draw 
force at any cross-section follows from 


P(@, t) = 2are/[1 + (0a/dz)? |? + 27 in r7.dr. (8) 


In particular, AP(t) = P(L, t) — P(O, t) and, to the extent that the 
solution for 7, contains u, p, ¢, the expression for AP depends on these 
fluid properties. 

The complete set of governing equations for the fiber drawing 
process includes an energy equation from which the temperature dis- 
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tribution T(r, z, t) in the draw-down region is determined. This, in 
turn, yields uw as function of position and time. Strictly speaking, 
therefore, the heat-transfer equation is coupled to the fluid-dynamic 
equations. However, since we give it a separate, detailed treatment 
elsewhere, we uncouple it from this preliminary discussion and in- 
troduce yx(r, z), for nonisothermal draw-down, as a function presumed 
obtainable from some heat transfer model. 

The general axisymmetric, free-surface flow problem posed by (1) to 
(7) is a formidable one. However, for the purposes of an engineering 
analysis and to gain some basic insight, much headway can be made 
by taking advantage of the fact that |a.| = O(e)<«<1 and |u/o| 
= O(e) <1 throughout the draw-down region. This “‘small slope” 
assumption expresses the obvious fact that fluid flow in fiber drawing 
is essentially one-dimensional. The consequences of this kinematic 
feature are developed in Appendix A, taking advantage of the fact that 
we are dealing with low-Reynolds-number flow away from regions of 
strong relaxation in velocity profile. Specifically, we find: 


(t) vy = 0, 1.e., “plug” flow throughout the draw-down region. 
(it) Tre ~ €(Tz, Tr, Te)? /A 
(ii2) wu x evr/a 

(tv) tp = 79, uniform over the cross-section 


(v) tz = —a/a-+ 3yv,, over the cross-section, 
where 3 yu is the so-called ‘“Trouton’’ viscosity for extensional 
flow in a liquid filament. 


These features of one-dimensional flow in the draw-down region sug- 
gest that mass and momentum conservation may be expressed con- 
veniently in terms of cross-sectional fluxes and stress integrals. Equa- 
tions of this kind may be obtained formally by integrating over the 
fiber cross-section: eq. (1) for volume conservation and eq. (3) for the 
axial momentum balance. Alternatively, we may derive these equations 
directly by taking a segment, of length dz, from the tapered axisym- 
metric filament as control volume. We obtain for volume conservation 


(av). + (a?), = 0 (9) 
and for axial momentum conservation 
p(a’v?), + p(a’v); — 3(a’uv.), — caz = 0. (10) 


For an eventual comparison with the early work of Pearson et al., 
we also record (10) after the terms in p and o have been dropped. We 
have the simplified momentum equation 


au, = M(t), (11) 
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where M is an arbitrary time function. This equation merely states 
that the draw force is uniform along the fiber and varies only with ¢. 
Iror later reference, the time-independent (i.e., steady-state) versions 
of (9) and (10) yield 
a’*v = const. = Q (12) 
and 
C = 3a*uv, — pQu + as, (13) 


where C is a constant of integration, viz. the cross-sectional draw force 
reduced by the momentum flux. 


IV. STEADY-STATE SOLUTIONS FOR THE DRAWING PROCESS 


In this section, we consider solutions of the time-independent 
equations (12) and (13). These constitute steady-state representa- 
tions of the drawing process which are of interest for two reasons. First, 
they model the steady drawing operation and yield some insight into 
its controlling parameters, i.e., the dependence of draw force and 
draw-down profile on the draw-down ratio, the viscosity profile, fluid 
inertia, and surface tension. Second, they provide reference states on 
which to build dynamic response models for fiber-stability studies. In 
this context, such solutions are often referred to as base flow models. 

It is well-known in fluid-dynamic stability theory that detailed 
features of the underlying base flow can be quite important to the 
predicted dynamic response. Hence, it is necessary that we examine 
several base-flow solutions of the drawing process for the physical 
features they represent. 

Starting from (13), one observes that the first term on the right-hand 
side represents the viscous stress effect, the second fluid inertia, and 
the third a contribution from surface tension. We assume the following 
fiber dimensions and fluid properties at the start of draw-down: 


= 10 cm/s 

a= 107%cm 

vz = 100/s (e.g., Av = 100 cm/s, over Az = 1 cm) 
= 2.5 gm/cm! 
= 200 dyn/cm 

# = 100 poise for soda lime glass 


1000 poise for fused silica. 


Note that the temperature at the interface between forming and draw- 
down zone is very dependent on as yet unknown fluid-dynamic and 
heat-transfer conditions in the forming zone. Therefore, the assumed 
values for u are rather tenuous. 100 poise probably represents a mini- 
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mum for soda lime glass, as might be expected in fiber drawing from 
crucibles. 

Given these data, the order-of-magnitude relations between terms 
in (18) are: 





Viscosity Viscous stress Inertia Surface Tension 
# (poise) — 8ua’v’ (dyn) pQv (dyn) ga (dyn) 
Soda lime glass 100 3 2.5 X 10-? 2 
Fused silica 1000 30 2.5 X 107 2 


Thus, inertia effects amount to barely 1 percent of the viscous term 
under the most favorable circumstances, while surface tension can be 
important when y is near its minimum. However, physical evidence 
suggests rapid quenching of glass fibers in the draw-down region. 
Hence, inertia and surface tension effects soon become negligible as the 
temperature drops by several hundred degrees over the first few milli- 
meters of the draw-down region, causing » to grow by several orders 
of magnitude (see, for example, Ref. 23). 

Disregarding, for a moment, the quenching effect that actually 
occurs in the draw-down region, we briefly consider an isothermal base 
state for two reasons. First, it permits an understanding of secondary 
physical effects such as inertia and surface tension, without being 
obscured by viscosity changes. Second, in later dynamic response 
studies, the isothermal base state serves as a basis of comparison for 
the stabilizing effect of the quenching that does occur. 

Letting the fluid properties in (13) be independent of z, we consider 
flow conditions such that inertia and surface-tension effects can be 
viewed as perturbations in relation to the viscous stress. Eliminating 
the radius a from (12) and (13) and nondimensionalizing according to 


a _) 


V = v/v, C= <2/L, ( ae? 


we find 
Ww’ — DY = —Wev? + Rev’, (14) 


where vp = vatz = 0, L = length of draw-down region, and D = LC/ 
3aqveuo, the nondimensionalized equivalent of C in (13). The inertia 
and the surface-tension terms are characterized by a Reynolds number 
Re = voLp/3puo and a Weber number, We = cL/3avouo. The elementary 
solution for (14) with We = Re = Ois 
Woo) = fine, (15) 

where we have used the boundary conditions 

v=1 at ¢=0 

V=£8 at ¢=1 
and H = v,/vois the so-called ‘‘draw-down ratio.’”’ Note that D = InE. 
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NORMALIZED FIBER RADIUS 


Corrections to this simple base-state solution for small We and Re 
can be found by perturbation methods. A first-order approximation 
for We < D and Re < D is found by substituting (15) into the r.h.s. 
of (14): 





Yo = exp | ny +7 -1]. a6) 
Resubstituting this into the right side of (14) for a second iteration, 
terms such as exp[e!'“], were approximated by power series prior to 
quadrature with respect to ¢. Then, 


1 
Ua) = ettne {2 = Wese | Re We InE — (We+ 3Re) | 


QnH’inE’ 2’ Init 
2Re 4We _ (4We + Re) 
+ Rez | InE ) Tn. } Ink, eae. ; c ; (17) 


where the expression for 3C[La, b, c, d, ¢] is recorded in Appendix B. 
Numerical results from (17) are best presented in terms of the non- 
dimensional radius a/ao = X. According to (12) 

X= w-}, (18) 


This has been plotted in Figs. 2 and 3 for a range of values in We and 


0.300 
0.000 0.100 0.200 0.300 0.400 0.500 0.600 0.700 0.800 0.900 1.000 


NONDIMENSIONAL DISTANCE ¢ 





Fig. 2—Perturbation solutions for base-flow profiles with increasing surface tension ; 
i = 10, Re = 0. 
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NORMALIZED FIBER RADIUS 
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Fig. 3—Perturbation solutions for base-flow profiles with increasing fluid inertia; 
£=10,We =0 


Re. These results show a tendency for filament contraction to be de- 
layed with increasing surface tension and fluid inertia, a familiar phe- 
nomenon from more accurate base-flow models obtained by numerical 
methods.?4 

Now let us return to the more realistic case of variable viscosity. We 
denote it uon(¢), where the second factor represents only the dimension- 
less dependence on ¢, with 7(0) = 1. Then (14) becomes 


nv’ — DY = —Wev? + Rew’. (19) 


Using an inverse approach, we can, for example, assume Re + 0, 
We = D = 0 (where D = InE is no longer true) and VW = e!!#, as in 
(15). This yields 

Re 

— —— oyflnE 

InE 7 (20) 
and constitutes an inertia-dominated base flow. Experimental data 
suggest that something like an exponential viscosity buildup along the 
draw-down region is a fair representation of quenching effects. Note 
that for this base-flow model, 


X == eo (InEl2)y (21) 


which has the disadvantage that X — 0 for ¢§ > 1. To provide a finite 
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asymptotic value for X, we may assume a draw-down profile of the 
form X = fee? +9 with f+g=1, 0 < f, g <1, and solve for ¥ 
from (18) and y from (19). 

We summarize the three elementary base-flow models obtained so 
far with the notation InE/2 = a: 


Unquenched base flow; Re = We = 0: 
X = e—%, WV = es, g=1. (22) 
Quenched base flow; Re ¥ 0; We = 0; 


Xx—0 for [-o: 
X=ent Wa eret y= eat, (23) 


Quenched base flow; Re = 0; We = 0; 
X finite for f — «: 
=e rg): Sa Te oy, mae e st 4g). (24) 


In addition to the inertia-dominated base flows (23) and (24), we can 
show that 


1 f 1 
x= eat + > WV — (1 —_— 2 = at) : a ex 25 
f g - ; er (25) 


is a solution of the approximate fluid-dynamic and heat-transfer equa- 
tions, for small a, if we let Re = We = O and assume 


R= poe2 G-TITo) | 


where 7) is the initial temperature and 6 >> 1. This represents a 
quenched base flow that is not inertia-dominated, in keeping with some 
of the perturbation equations discussed later on. Note that for each of 
these base-flow models da/dz = 0(aao/L). Sincea = 0(1) but ao/L<1, 
this means that |da/dz| «<1 and confirms the basic assumption pro- 
viding for one-dimensional flow, as discussed in Section III. Note also 
that the viscosity profiles in models (23), (24), and (25), which reflect 
a cooling process along the fiber, are connected with the draw-down 
profile through the parameter a. This parameter is indicative of the 
quenching rate in yn, and also controls x’, the slope of the draw-down 
profile. 

The draw-force follows from any of these solutions by the obvious 
relation, in dimensional form, 


P = ra’r, + 2aro = 8ra*uv, + rac. (26) 


If we neglect o« and substitute one of the base flows, we find that P 
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depends on do, vo, wo as well as the draw-down ratio and the quench- 
ing rate a; all of which could be expected on physical grounds. 

Clearly, we could refine upon the viscosity profiles to be used in 
simple base-flow models. Since some experimental temperature profiles 
tend to show an exponential decay and the viscosity temperature 
relation for many glasses is of the form 7 ~ e*/7, we might consider 
functions of the form 

n= r+sexp(e). 

However, such elaborations result in a loss of mathematical simplicity 
and usually lead to equations for VY that require numerical integration. 
At that point, it seems more appropriate to solve the coupled heat- 
transfer/flow problem by numerical means. This has been done and is 
documented elsewhere.4 


V. PERTURBATION EQUATIONS FOR TENSILE STABILITY MODELS 


In this section, we develop the first-order perturbation equations 
necessary for a linear stability analysis of tensile fiber models. Let the 
first-order solution of (9), (10), and (11) be denoted 


a = a(z)[1 + d(z, t)] 
bd = v(z)[1 + dz, t)] (27) 
v(z)[1 2 b(z, t) J, 
where a(z), v(z), and v(z) represent radius, velocity, and kinematic 
viscosity for a suitable base state, in dimensional form for the time 
being, and d, 4, # are dimensionless first-order perturbations of these 


quantities. 
Substituting (27) into (9), the first-order variation of the continuity 


equation reads 


uw 
I 


Is 1 ae 
Gz + 7 ai + 5b: = 0. (28) 


Similarly, (10) without the surface-tension term becomes 


, . fv v 2v | ae , {2 2v 
Ors He Ds —+--=— — td + a, —— 
v ov 3p 


v 


Z Vv 2v VUz 2v 
— ——— A + 4 y i a A eee 
By a (22 zy) /rt+e(4 a) /? 


and from (11), the momentum equation without inertia terms, 


a+etottea=%, (30) 
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where we have used a?vv, = Po = const. from (11) and d(t) is a per- 
turbation of M(t). Since the examples in later sections use the base 
states of Section IV, it is convenient to nondimensionalize the space 
and time variables as 


of = 3 Pua =7 (31) 
and use the notation 
d(C )_ ; dC) _ : 


The specific versions of (28), (29), and (30) now develop as follows. 
With the base states (22) or (23), eq. (28) becomes 


a’ + 48’ + de = 0, (33) 
Neglecting inertia in the momentum equation, (22) and (80), for the 
unquenched inertialess base state, yields 


24+¢6 ++ 48 = g(r), (34) 


where ¢ is an arbitrary function of 7. On the other hand, the momen- 
tum equation (29), using the quenched inertia-dominated base state 
(23) leads to 

a” — 46 — Ade? — Qie*8 = —25' — 40. (35) 


Turning now to the inertialess, quenched base state (25), we revert to 
(28) and (80) as basic equations, but reserve the substitution of v and 
vz from (25) for a later time. 

For some of the examples treated in later sections, it is convenient 
to eliminate 6 from (34) and (35) by means of (33) and similarly from 
(28) and (80) for base state (25). The resulting equations for d are 
recorded in Appendix C for later reference. 

Conversely, the boundary conditions for some problems demand an 
equation in §. This is the case with steady-state responses to changes 
in the takeup velocity, which we treat in the next section. If (27) is 
used in (12) and we let Q = Q(1 + 4), the first-order variation of that 
equation yields 

246+ 6 = @. (36) 


Next, consider (13), where we neglect surface tension and perturba- 
tions in v. As noted before, the constant C may be interpreted as a 
force parameter, carried from the forming zone to the draw-down 
region. Taking the first variation of (13), letting C = C(1 +0), and 
eliminating @ by means of (36), 


feet ne ey 
a= sone t 4(z =) on 


pe v 
ae 
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Nondimensionalizing the base-flow variables in this equation according 
to Section IV, we have 





p Rev. =k Ree W;\., 
p— Beg by (RE), (38) 
where 
_ Le. 
 Sazvono 


Equation (88) is designed to yield changes in the steady-state velocity 
profile, 4, as a function of ¢ and k, which are perturbations of the volume 
flux and force parameter of the base flow. 

In summary, this section has developed first-order perturbation 
equations based on the continuity and momentum equations of 
Section III. The perturbation equations were given in terms of 4 or 6 
as needed for the steady-state and dynamic-response studies to be 
pursued in Sections VI and VII. 


Vi. PERTURBATIONS OF THE BASE FLOW 


In this section, we use the time-independent versions of perturbation 
equations derived in Section V to display changes in several base-flow 
solutions due to shifts in such steady-state parameters as the boundary 
values and the viscosity profile along the fiber. Since these parameters 
are often accessible to control in real fiber-drawing processes, their 
effects on the steady flow are of operational interest. Obviously such 
effects could be determined by differencing neighboring base-flow solu- 
tions in the control-parameters space; however, exhibiting the changes 
(analytically) as first-order perturbations can yield useful insight for 
the design of feedback controls. 

We start by examining the response of steady state (24) to a change 
dz in take-up speed. Substituting the appropriate base-flow expressions 
for W and 7 into (38), one has 


e-8 en? 
o’ — Re ae 6 = ke~*x + (Re — 2f) oa q, (39) 
where 
saat ( Va) gu forty 
0d 


and a factor of 1/a has been absorbed in Re and k. The boundary con- 
dition for (39) is 


0=d, at 6=3r=a. (40) 
We find 
. kx? 2f Pe 
= V —Re| ae ac + poh doe my 
’ aie 2(1 + f) (i 1) : (41) 
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with 
kX, 
2(1 + f) 


where X, = X(zz). Given (41), the corresponding changes in fiber 
radius 4 follow from (86). 

Additional features of interest with (41) are d at z = 0 and perturba- 
tions of the draw force at either end. Thus, 


V= Eg + (1 > i) ‘| 2 ke (2 + Re/f), 


Senge ee es Y 5 aan . 
do = 0(0) = V si+ f) + (ZZ 1) q. (42) 
For the draw force, 

P= P(L+ p) = 30p@,, 
which yields 





a a ee e°sd 
P=4 a oe qd 2f 
From this, 
enn, he ee Vite k 
Po= Gt “OF ab 20 +f) (43) 
and 
A A a kx? 
pr= q+ F Te ey ae (44) 


2(1 + f)’ 


the changes in draw force, as function of dz, g, and k. 

The solution (41) is of little more than conceptual value as long as ¢ 
and k are unknown. Recall from (38) that these parameters represent 
perturbations in the integration constants of the base-flow solution for 
the draw-down region: ¢, a change in the volume flux, and k, a change 
in the force parameter of the momentum equation. Such changes must, 
in general, be expected to enter from the forming zone when the steady 
state is altered due to dz. 

Fortunately, ¢ = 0 for drawing from a preform that is fed at a 
constant rate. However, the exit flow from a crucible (Fig. la) does 
not provide such a simple condition. If operating at a low head in the 
reservoir, we would expect the entrance flow into the orifice to be 
affected by changes in the take-up speed. In neither case does there 
exist an obvious condition for the force parameter k [i.e., é and C, see 
eq. (38) ]. 

A theory of the forming zone should be able to relate 49 and jo on 
the one hand with g and & on the other. Given such relations, these 
would combine with (42) and (43) to determine @ and &k in terms of 
dz, and hence @, 4, p as functions of dz. Depending on the different 
situations depicted in Fig. 1, the relations of do, #o vs g, k in the forming 
zone could vary considerably. In some cases, an understanding of the 
complex fluid-dynamic and heat-transfer processes of the forming 
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zone (see Fig. 1d) would seem essential for a satisfactory representation 
of speed-diameter-force relations at the take-up end. 

In the remainder of this section, we examine the sensitivity of steady- 
state flow in the draw-down region to changes in the viscosity profile. 
Such changes can be viewed as consequences of perturbations in the 
heat-transfer mechanism. Our primary purpose is to determine re- 
sponse amplitudes for 4, the perturbation in draw-down profile, as 
functions of #, an amplitude parameter of the viscosity perturbation. 
Once again we encounter the problem of assuming reasonable boundary 
conditions at z = 0 without a dynamic model of the forming zone. 

We consider two different cases: the draw-down response without 
inertia effects imposed on the unquenched, inertialess base state (22), 
and the response with inertia effects imposed on the quenched, inertia- 
dominated base state (23). In particular, we shall be working with 
eqs. (76) and (77) after deletion of the time derivatives. 

In both cases, we consider a viscosity perturbation of the form 


6 = bose-73, (45a) 
which represents a distribution of arbitrary amplitude and spread, 


determined by # and y, respectively. A family of such functions is 
displayed in Fig. 4 for 1 S$ y S 6. Note that for a given value of y 
ae (45b) 


Pom ax ve ; 





the peak viscosity perturbation, normalized w.r.t. fo. 

Let us consider various boundary conditions that may be applicable 
to solutions of this problem. If we assume that the fiber is drawn from 
a preform with constant feed and take-up conditions, an obvious 
boundary condition is 

G(3,) = 0. (46a) 


In view of ¢ = 0, it seems reasonable to assume that the forming zone 
will respond with #(0) = 0. 6’(0) can be eliminated from these two 
conditions to yield 





roy «. & 0) _ 
a’ (0) (0) &(0) = 0. (46b) 
If three boundary conditions are needed, we take 
4(0) = 0, (46c) 


and then, according to (46b), also 4’(0) = 0. Finally, for a fourth con- 
dition, let 
a’ (31) = 0, (46d) 


which implies that p(%z) = 0, ie., no perturbation of the draw force 
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VISCOSITY PERTURBATION 
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Fig. 4—Profiles of the viscosity perturbation, v/v = de7%4, 


occurs at the take-up end. Conditions (46) will be invoked as necessary 
with increasing order of the perturbation equations. 

We start by considering the inertialess perturbation eq. (77). We 
have 


= ~ Bev (543) + Bot Bs (47) 
Y Y 


Results from (47) have been plotted in Fig. 5 to represent the effect 
of viscosity perturbations on the unquenched base state. Note the non- 
monotonic evolution of these curves with increasing y. 

Next, we examine the corresponding results, including effects of fluid 
inertia. Integrating (76) after deletion of the time derivatives, the 
quenched inertia-dominated base state (23) leads to 


al! +26" — 44’ — 84 = 6 + 49° +484-C. (48) 


C is a constant of integration. 
Substituting (45) into (48), we obtain 


‘s ap AVG 1 =, 
CES Garo) (3 ee )é 


+ Bye? + (Be + B33)e7?? + Bg. (49) 
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RADIAL PERTURBATION 
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Fig. 5—Draw-down response to the viscosity perturbations of Fig. 4, without 

fluid inertia, for unquenched base flow. 
Figure 6 shows (49), the effect of viscosity perturbations on the 
quenched, inertia-dominated base state, over the range 1 S y S 6. 
Note that the presence of quenching significantly alters the evolution 
of response curves in Fig. 6, which becomes monotonic with y. Typical 
amplitudes in this case are about one-half to one-third as large as for 
the unquenched base flow (Fig. 5). 

The main inference to be drawn from a comparison of Figs. 5 and 6 
is that typical peak amplitudes for G@/% are reduced significantly due 
to fluid inertia and quenching. 

By way of specific example, we consider results for y = 3.5: 


Maximum for viscosity perturbation (Fig. 4) #/%) = 0.110 
Maximum for response without inertia on 

unquenched base state (Fig. 5) G/%o = 0.018 
Maximum for response with inertia on 

quenched base state (Fig. 6) G/%o = 0.005 


The latter case, which represents the more realistic model, also predicts 
the lower response amplitudes. Thus, for example, a 10-percent de- 
parture from the nominal viscosity profile would cause only a 0.5- 
percent departure from the draw-down profile. 
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RADIAL PERTURBATION 
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Fig. 6—Draw-down response to viscosity perturbations, with fluid inertia, for 
quenched base flow. 


In summary, the first-order results given in this section yield a 
qualitative indication of base flow responses to time-invariant changes 
of the boundary conditions and of the viscosity profile. As stated in 
Section IV, a more satisfactory treatment of viscosity effects may be 
achieved by numerical integration of the base-flow equations, which 
introduce heat-transfer perturbations through the energy equation. 
However, the question of realistic interface conditions between the 
drawn-down region and forming zone remains open until the latter is 
included in our model. 


Vil. THE DYNAMIC RESPONSE OF TENSILE FIBER MODELS 


We turn now to the dynamic response of tensile fiber models for the 
unquenched and quenched base flows; i.e., we address solutions of the 
equations in Appendix C, including the time-dependent terms. Un- 
fortunately, the formulation allowing for fluid inertia, eq. (76), does 
not lend itself to a simple solution. We therefore seek what preliminary 
insight can be gained from solutions obtainable with (77) and (78), 
i.e., by neglecting inertia in the perturbation equations. 

Starting with (77), which represents perturbations of the un- 
quenched base flow, the operator on the left-hand-side suggests a gen- 
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Fig. 7—Tensile fiber model: frequency response of surface perturbation at 3, = 2 
for base state [eq. (22) ]. 


eral solution of traveling waveform 


a! = f(r + he) (50) 


and 


Ae i f(r + he-?)da + &(r), (51) 


where ®(r) is an arbitrary time function. This solution may be used 
to represent radial excitations at the source or take-up end or to satisfy 
boundary conditions in the presence of a particular solution. In the 
former case, where 4(0, +r) = sinwr, we reconstruct a solution by 
Pearson and Matovitch" of the form 


G(3, 7) = Ai(%) sin wr + Ao(3) COS wr (52) 
with the terminal response amplitude 
A (a, 51) a LAi(w, 31) + A3(a, 1) J}. (53) 


This is normalized with respect to do, the amplitude of radial perturba- 
tions at 6 = 0, and plotted, for later comparison, in Fig. 7 as a function 
of w, for 3, = 2. It shows a series of response peaks presumably due to 
the absence of quenching from the base state (22), used in (77). 
These response peaks are commonly referred to in the literature as 
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“draw resonances.” The author takes exception to this term since it 
implies the existence of natural frequencies contingent on the inter- 
action of system inertia and some restoring force, neither of which is 
obvious in the present model. Nevertheless, experimental evidence con- 
firms the occurrence of highly amplified responses near some of the 
“critical” frequencies predicted by Pearson’s model. Typical radial 
perturbation profiles for @(3, r) are shown in Fig. 8 for w = 100, at 
T = 0, the start of a period, and 7 = 0.25, its quarter-point. They 
illustrate the spatial amplification of surface perturbations occurring 
along the draw path. 

Since the direct physical realization of radial perturbations at 
d = 0 may be difficult, we now examine the effect of viscosity perturba- 
tions that are convected along the fiber as a consequence of fluctuations 
in the heat source; i.e., 


9 = g(r + 4e-%), (54, 


Substitution into (77) yields the inhomogeneous equation 
ar eae fanaa Ue) (55) 
Or 03 we 

where the dot is also used to designate differentiation with respect to 


40 


30 


N 
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ae 8a—Surface perturbation for base state [eq. (22)] with w = 100, 3, = 2, 
at T = 0. 
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Fig. 8b—é at T = 0.25. 


the compound space-time argument. If we substitute 
G! = h(r + 4e-**)e-8, 
this leads to h = —4g; hence, the particular solution 
& = —39(7 + 36) = —99. (56) 


This suggests that any space-time history of viscosity changes given 
in the form (54) translates into changes of fiber radius by the factor 
— 7. It is a traveling-wave type of response only in the sense that it is 
convected with the moving fluid.* 

To take a specific example, consider a periodic viscosity perturba- 
tion, as would be caused by misalignment of the rotating laser beam 
used in heating the preform, 


o = sinw(7 + 7), (57) 


“The negative sign may appear surprising at first. One notes, however, from the 
continuity equation (33) written as 


(e-*49/ar + 0/03)4 = —4W’, 


that 4 of the form g(r + 4724) leads to 0’ = 0. This means that such a form of radial 
perturbation can travel with the flowing fiber without perturbing the local velocity. 
Further, if a constant draw force is to be maintained, the expression 


P/p = raw’ 


shows that, with v’ unperturbed, an increase in v requires a decrease in a of half this 
magnitude. 
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where 7 = 3e7’, yielding the particular solution 


A 


Gy = —3 SIN w7 COS wn — 3 COS wT SIN wy. (58) 
The boundary conditions to be satisfied with the help of (51) are then 


5(0, 7) = 0, d(31, T) = 0, 


4.(0, 1) = }s8in w(r + 9). (59) 


In this case, the perturbations are driven only by the variations in 
viscosity. 
An appropriate form for f is 


f(8, 7) = ficosw(7r +7) — fesinw(r + 7) (60) 
so that 


G.(4, rT) = $(ficoswr — fesin wr) [Cr]? 
— $(fisin wr + fecos wr) [Si |e? + (r), (61) 
where 





: #/2 eos w 
[cine = [ dw 
on 


WwW 


#2 sin w 
[sie = [Paw 
a 


” WwW 





and f1, fe are integration constants. After determining ®(r), fi, and 
fa to satisfy (59), once again a solution of the form (52) is obtained, 
where A, and A: are recorded in Appendix D. It is interesting to note 
that A(w, 3z) for this case, if normalized in terms of #9 and plotted as 
in Fig. 7 shows exactly the same response spectrum, but with half the 
amplitudes. Moreover, the profiles of radial perturbations along the 
fiber for this case strongly resemble the ones obtained for radial excita- 
tion at 3 = 0 (Fig. 8). 

The sharp response peaks given by the above solutions at certain 
frequencies reflect the absence of quenching in the base flow (22). In 
search of some allowance for quenching effects, we consider two ad hoc 
modifications of the tensile fiber model: the base flow (25), which 
includes moderate quenching together with moderate draw-down, and 
base flow (23), which represents quenched, inertia-dominated, exponen- 
tial draw-down. 

Using eq. (78), which is based on (25), a solution of this perturbation 
equation is developed in Appendix D for the case of cyclic perturba- 
tions in the starting radius d(0, 7) = sin wr. The resulting expression 
for &(f, r) is recorded in (88). It turns out that the term e—7 sin wr 
exceeds all other contributions to the dynamic response by several 
orders of magnitude for all values of w and y of interest. Thus, the 
perturbation in the fiber radius is merely a shift in the exponential 


OPTICAL FIBER DRAWING = 1035 


draw-down profile of the base state, whose spatial variation is inde- 
pendent of w and characterized by y, the quenching parameter of the 
base-flow viscosity profile v = v evs. This perturbation oscillates in 
time with constant phase along the entire draw path. Typical profiles 
of d are shown in Fig. 9, again for T = 0, the start of a period, and 
T = 0.25, its quarter-point. A contribution from the nonexponential 
components in d(f, 7) is only seen at T = 0, the zero-crossing of its 
sin wr term. These profiles show none of the spatial amplification evi- 
dent in Fig. 8, which confirms, at least qualitatively, the attenuating 
effect of the quenching process. Unfortunately, a limiting comparison 
between the two models is not possible as the quenching effect is made 
to vanish, since that also requires a vanishing of the draw-down in base 
state (25). 

If the quenched, inertia-dominated base flow (23) is employed in 
the inertialess perturbation equation (30), the resulting model is indeed 
subject to criticism as logically inconsistent. However, as a plausibility 
argument, we might suggest that inclusion of inertia in the base flow 
would at least give a qualitative indication of changes to be expected 
from a more complete allowance for inertia effects. The formal exercise, 
starting from (23) and (80), closely resembles the derivation of (79). 
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Fig. 9a—Surface perturbation for base state [eq. (25)] with g = 0.75, w = 20, 
y =2,atT = 0. 
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Fig. 9b>—4é at T = 0.25. 


The resulting frequency response functions evolve quite clearly from 
curves such as Fig. 7 with considerable smoothing of peaks and valleys. 
This suppression of the response peaks, due to quenching, certainly 
agrees with experimental evidence. 

The main point to be made in this preliminary assessment of tensile 
fiber models is that their frequency response curves and surface per- 
turbation profiles bear little resemblance to the perturbations caused by 
surface tension, which we discuss in the next section. To generate 
more realistic response predictions for tensile fiber models, including 
inertia effects and heat transfer, we will have to resort to numerical 
means. 


Vill. THE DYNAMIC RESPONSE OF CAPILLARY FIBER MODELS 


We next inquire under what circumstances the well-known phe- 
nomena of surface-tension-driven perturbations on liquid filaments 
apply in the fiber-drawing problem. Indeed, there may be limited 
portions of the draw path, presumably near the hot tip of the forming 
zone, where the viscosity drops low enough for surface tension to be- 
come significant. At least for low-melting glasses, such as soda lime, 
this is a possibility, as born out by the comparison of essential terms in 
the base-flow equations of Section IV. We shall characterize this type 
of fluid-dynamic behavior as capillary fiber models. As we shall see, 
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their spatial response profiles under harmonic radial excitation are 
totally different from those of tensile fiber models. By all indications, 
these two kinds of filament response are fundamentally distinct phe- 
nomena. The ultimate question is to what extent they coexist and 
interact in a real fiber-drawing process. 

Rayleigh’s classical theory of capillary jets is our point of depar- 
ture.>.16 It shows that the growth rate of ‘‘varicose’ perturbations 
(axisymmetric harmonic surface modulations) is given by 


o(k? — 1) 


"= Fale +1 = BRO RGT’ a 


where 


real, the rate of growth 

= fiber radius 

= wavelength of the perturbation 
2ra/Xr 

I, and J, = modified Bessel functions. 


mre & 
| 


The denominator of (62) turns out to be negative for all &.* 

If this capillary response model is locally applied to a base-flow model 
such as (24), assuming that base-state parameters change negligibly 
over the wavelength A, we may use it to construct the dynamic re- 
sponse along a contracting fiber. Then the evolution of a small surface 
disturbance may be synthesized using the stepwise relation between 
displacement amplitudes at successive instants of time 


Oa = u, (“H+ mat), (63) 
where 


U, and Un4i = peak amplitudes of sinusoidal surface perturbations 
at t, and tn41, respectively 


Seth = stepwise scale factor due to fiber draw-down 
m = m at center of wavelength 
At = tnt = bas 


Our response simulation convects the end points of a given perturba- 
tive wavelength at their respective speeds, while computing local fiber 


* Note that (62) results from a simplification of the characteristic equation, 
neglecting inertia effects, which in turn precludes initial conditions on perturbative 
velocities. However, it can be shown that the quantitative effect of this approximation 
on m is trivial (Ref. 19). 
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Fig. 10—Response histories for different values of &. 
properties from the base model (24). We use the notation: 


X = a/ao WY = v/v n = p/bo 
U= U/U,, a= Ado, No Ao/ Qo, 


and 


Z = (8 + q)/2ao, 
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where s, g, are end points of the perturbative wavelength. The non- 
dimensionalized surface tension is assumed to vary as 

¢=¢6 pene, 
allowing for possible changes along the fiber (small values of 8 approxi- 
mate a linear dependence). 

Figure 10 shows a set of response histories in terms of log [U/X] for 
g = 0.1, Xo = 100, & = 0.25, 6 =0, no = 300, and 0.0008 S$ & 
< 0.0016, illustrating the build-up to different asymptotic levels as a 
function of @. 

Figure 11 shows typical profiles of surface perturbations along the 
fiber for \) = 15.7, which corresponds to a frequency of 200 Hz. As 
expected, the varicose response consists of sinusoids whose wavelength 
is progressively stretched due to fiber draw-down and whose amplitudes 
are modulated according to a response history such as given in Fig. 10. 
Note that this behavior differs drastically from the tensile fiber re- 
sponse of Figs. 8 and 9, which are also driven by radial harmonic 
excitation at the origin. 

Figure 12 displays asymptotic response amplitudes for a range of 
initial wave numbers 27/Xo and several values of g, the terminal radius 
in units of ao for the base flow. For fixed g and 4, the response has a 
maximum in the neighborhood of \) = 50. The low responses at short 
wavelengths are due to vanishing of the Rayleigh instability as \y)5 > 27 
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Fig. 11—Plot of deflected surface shapes for X>9 = 15.7. 
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Fig. 12—Amplification factor vs dimensionless wave number for & = 0.001. 


while for large Xo, hence large %o, the perturbations encounter a rapid 
viscosity build-up along the fiber. Contrast these response curves with 
the ones for tensile models, and a fundamental difference is again 
apparent. 

The influence of the terminal radius g, is illustrated in Fig. 13. As 
expected with a in the denominator of (62), increased draw-down 
(decreasing g) will enhance terminal perturbations. The second curve 
on that figure indicates the effect of 6, the growth rate of surface 
tension. Finally Fig. 14 shows the decrease in response with initial 
viscosity, no and the increase with rising values of &o, the initial surface 
tension. 

Since the above simulation averages fiber properties over a perturba- 
tive wavelength and does not ensure continuity of perturbative surface 
velocities between time steps, it seemed appropriate to corroborate it 
by aslightly different model, due to Tomotika,!’’!8 which is also germane 
to our situation. In Tomotika’s study, filament contraction is effected 
by a surrounding medium subjected to extensional shear flow, as in 
some of Taylor’s experiments.® This apparent difference in base flow 
and the need to let ambient viscosity approach zero for our purposes 
seems to limit the applicability of Tomotika’s model to the fiber- 
drawing problem. However, as we shall see, it agrees quite well with 
our adaptation of Rayleigh’s theory. 
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Fig. 183—Amplification factor vs g and £. 


Tomotika finds the following asymptotic expression for interface 
perturbations as a function of several model parameters: 


7 a 
log [U/X]e = oth} f, y (B)dk, (64) 
where 


& = ratio of filament to ambient viscosity 
k = 2na/d = local wave number of the perturbation 


é = nondimensionalized surface tension 
v(z) = a kernel that is detailed in the references. 
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The behavior of this expression is illustrated in Fig. 15 by plotting 
it against ko for various values of £ These plots show that, in the 
absence of quenching, the stabilizing effect of ambient shear flow 
diminishes steadily with ambient viscosity while the maximum response 
shifts to higher frequencies. This behavior is altered significantly if 
exponential changes of filament viscosity and surface tension are in- 
troduced to represent quenching effects, similar to our modification of 
Rayleigh’s analysis. Equation (64) then changes to 


" ko a ee 
log (O/*]. = soe f , u(t — Bo @ak, (65) 
0 


No 
200 250 300 350 400 450 
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) 
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Fig. 14—Amplification factor vs no and &o. 
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Fig. 15—Amplification factor vs ko for £ > © (Tomotika’s model with constant 
viscosity and surface tension). 


ne ~T(E\t  y- 
i sas a-a|(=) - 4 
0 


and () is another kernel detailed in the references. Plots of this 
expression in Fig. 16 show good agreement with Fig. 12 for large & 
(and g = 0.1). Plots of (65) for — = 10° and various values of the 
terminal base flow radius are given in Fig. 17. They show the same 
response of [U/X].. for g > 0 as Fig. 13. 

In summary, these results indicate that the two capillary models of 
quenched fiber responses, obtained by modifying Rayleigh’s and 
Tomotika’s analyses, are essentially equivalent. Note again that none 
of the response curves, such as Fig. 17, bear any resemblance to those 
of tensile fiber models. 

An additional piece of insight into capillary response mechanisms 
comes from Weber’s work.’ He reproduces Rayleigh’s analysis by a 
somewhat different approach and obtains an exact equation for m, as 
well as a simplified expression that agrees with (62). Weber shows that 
the small errors in (62) are essentially due to the neglect of radial 
components of the flow field. He demonstrates this conclusively by 
rederiving (62) from a one-dimensional representation (recorded in 
Appendix E) which captures all salient. features of the capillary re- 


where 
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Fig. 16—Amplification factor vs ky) for a = 0.001, g = 0.1, and 1 S$ — S$ 108 
(modification of Tomotika’s analysis for quenched base state). 
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Fig. 17—Amplification factor vs ko for & = 0.001, € = 10°, and 0.005 S$ g S 0.20 
(modification of Tomotika’s analysis for quenched base state). 
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Fig. 18—Comparison of capillary and tensile instabilities. 


sponse mechanism. Thus, multidimensionality is not the criterion that 
distinguishes capillary from tensile response models. The fundamental 
difference seems rather to lie in the energy source from which the 
perturbations are fed: surface tension in one case and axial stress in 
the other. 

Figure 18 attempts to emphasize this distinction in a pictorial 
fashion: (7) In the capillary model, surface tension, by overcoming 
viscous stresses, tends to accumulate fluid from both directions into 
periodic “beads,” ultimately pinching off individual droplets as the 
minimum energy configuration. A multiplicity of such separations can 
occur independently of each other, and the essential physical mecha- 
nism is equally as valid for stationary, uniform filaments as it is for 
contracting base flows. (iz) In the tensile model, on the other hand, 
the ‘worst’? among random localized constrictions, due to surface 
perturbations, causes a tensile stress concentration which further 
reduces the cross-sectional area and results in a single, “run-away”’ 
tensile separation. This is the familiar necking of any tensile test. 
specimen. The tensile stress associated with draw-down in the base 
flow is an essential prerequisite for this mechanism. Surface tension 
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will indeed contribute to tensile resistance in the necked down section, 
but it does not fundamentally alter the separation process. 

Thus, the capillary mechanism is a contest between surface tension 
and viscosity, independent of filament draw-down, whereas in the 
tensile mechanism the viscous stresses, jointly with surface tension, 
attempt to resist the draw force. 


IX. SUMMARY AND CONCLUSIONS 


Several important observations result from the discussions in the 
preceding sections. 


(2) 


(72) 


(iii) 


Optical fiber drawing differs from textile fiber “spinning” in 
several essential ways. The flow of glass in the forming zone 
and draw-down region is dominated by viscous stresses. 
Inertia and surface tension play secondary roles. (They be- 
come noticeable only for the lower-melting glasses and then 
only in limited portions of the flow field.) This contrasts with 
polymer fiber forming, where, in some instances, viscosities can 
be quite low and other effects may be of comparable importance. 
Moreover, glass fibers are quenched over a wider range of tem- 
peratures than polymer fibers. This, together with the extreme 
temperature-dependence of glass viscosities, causes viscosity 
profiles along the draw path to rise much more abruptly than in 
textile fibers. Finally, and perhaps most importantly, molten 
glass can be considered very nearly Newtonian, which is not 
true for most polymers. Given the above physical features and 
the small-slope assumption of gradual draw-down, we can 
justify one-dimensional base states as useful representations of 
steady flow in the draw-down region. 

Given a base-flow model, its sensitivity to changes in operating 
conditions, such as the take-up speed and temperature profile, 
can be estimated by first-order perturbations. It turns out that 
the draw-down profile is relatively insensitive to significant 
viscosity changes, assuming that interface conditions between 
the draw-down region and forming zone have been modeled 
correctly. For nontrivial forming zones, e.g., Fig. la, b, or d, it 
is difficult to make reasonable assumptions for these conditions. 
Since we lack a complete understanding of the forming zone, 
but expect its flow field to change with perturbations in the 
draw-down region, our results must be considered tentative. 
Fundamental differences exist between the tensile and capillary 
models of dynamic fiber response. The tensile mechanism seems 
to prevail in most of the draw-down region. For low-melting 
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glasses, we conjecture that the capillary model may apply in 
the short and very hot transition between forming zone and 
draw-down region, where surface tension can sustain perturba- 
tions that subsequently propagate by the tensile mechanism. 
The interactions of these two phenomena and their relation to 
dynamic responses in the forming zone itself are presently not 
understood. In the following we amplify each of these points to 
some extent. 


The base-flow models we supplied in Section IV are admittedly 
qualitative. A heat-transfer analysis was circumvented in this pre- 
liminary study by assuming exponential viscosity profiles, suggested 
by qualitative experimental evidence. Exact solutions of the coupled 
one-dimensional momentum and heat-transfer equations are now being 
carried out to allow for different kinds of heating in the forming zone 
and various cooling mechanisms in the draw-down region. The resulting 
simulation will be able to provide more detailed operational trade-offs 
between steady drawing parameters. It will also assess the limited in- 
fluence of fluid inertia, surface tension, and gravity. Finally, this 
modeling effort presents a natural opportunity for experimental cor- 
roboration by suitably instrumented steady-state runs, using laser 
and/or furnace-heated preforms or crucibles. 

As an extension of one-dimensional base-flow models, radial-heat- 
transfer mechanisms should be simulated, leading to nonuniform cross- 
sectional viscosity distributions at the start of the draw-down region. 
These viscosity distributions must be input to a perturbation model of 
axisymmetric free surface flow which generates the nonplanar velocity 
profiles expected in the transition between forming zone and draw- 
down region. Ultimately, the detailed flow fields of forming zones such 
as Fig. 1b and d may have to be simulated by discretization techniques. 
If properly combined, these efforts may, hopefully, result in a unified 
base-flow model that properly allows for interactions between the 
draw-down region and forming zone in representing steady-state re- 
sponses to changes in the control parameters of the draw process. 

Finally, as mentioned before, it appears that vastly different fre- 
quency response curves and longitudinal profiles of surface perturba- 
tions characterize tensile and capillary dynamic responses as fundamen- 
tally distinct physical mechanisms. (Note the intuitive distinctions 
given at the end of Section VIII.) They do not seem derivable, in proper 
relation, from some universal fiber stability analysis. The question is 
then, what must be done to develop them into parts of a realistic and 
unified dynamic response model. 

Since analytic solutions for tensile responses of nontrivial base flows 
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in the draw-down region are not possible, numerical solutions by modal 
analysis or space-time integration are being attempted. This is ex- 
pected to shed further light on the anti-intuitive response profiles ob- 
tained from the ‘“‘inertialess’” perturbation equations. Similarly, the 
capulary model applied so far to an elementary base flow may be im- 
plemented, by numerical means, for conditions representative of the 
transition between forming zone and draw-down region. Combining 
these extensions of the tensile and capillary response models, it may 
be possible to relate dynamic records of thermal or mechanical surface 
perturbations coming out of the forming zone to diameter variations in 
the finished fiber. 
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APPENDIX A 
Implications of the Small-Slope Approximation 


We briefly sketch the consequences of the small-slope approximation, 
|az| <1 and |u/v| <1, as they evolve from (1), (2), (3), and (4) by 
scaling arguments. Let 


v= vw, U= UY, w= Ho, P= Pot (66) 
a = aX, r = aoé, z2=Lt, and t= 5 
0 


Then the small-slope assumption amounts to 


=, Go\ _ 
=0(9) e<l. 


Substitution of (66) into (4) yields 


= el Pal, + 20 | & = Hoo 
ov 


Uo 
Vo 








saan L 
i Bee Pol i+ ny; a OED ae 
L Lovo L (67) 
L 
ry = He) — Bs pee et 2ne/t| = ae is 


Lovo . Movo . 
Tre = Hy [ Dy cer | Re on 


where poL/uovo = O(1) and will be omitted henceforth. Substituting 


OPTICAL FIBER DRAWING 1049 


(66) and (67) into (8), 


lL, + (ah) on + 90 


11 
= (—e + 2nV¥s)¢ + dt Cén(We + e yy) Je. 
Considering the case pvol/uo = Re <1, 


0 = &(-1 + 2n¥;); + : Lén(We + ey) |. 


Hence, 
(énVe)e = 
If no constraint is to be imposed on y, we have 
Vv; = 0 (68) 
and 


Tre = _ eng; = OLe(r2, Tr Te) ]- 
Now, from (1), 


t+ (= =) (oe + #/8) = 0 
and because of (68) 


ge = o/§ = —Z¥y. (69) 
Then, 
Tr = To = —ct— hy (702) 
€ 
in = — 5 EV ss 
and 
7, = —et QnYy. (70b) 


tis determined from the normal stress condition at the fiber surface 
& = X. In dimensional form, 


a(adz, — a2 — 1) 


a(1 + az)? 


Substituting (66) and (70) into (71) and dropping terms of 0(¢?) and 
higher, as well as a,,, one finds 


(71) 


Te + Ot, — 20:Tr = 


— nV; (72) 


for all values of £, where ¢ = oL/povodo. With this result, 7, from (70a) 
becomes 


t= zy 3n¥;, (73) 
where 387 constitutes the ‘“‘Trouton” viscosity. (The additional 7¥; 
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term, augmenting 27; in (70), originated in 7, of (71). This, in turn, 
is due to ¢; of (67), the cross-sectional contraction that necessarily ac- 
companies the extensional flow of fiber drawing.) We also note from 
(72) and (70) that 

(eh ae Site = To, = 0; (74) 
which agrees with the radial equilibrium equation if (66), (68), and 


(70) are used in (2) and we let Re < 1. 
The tangential surface stress condition at & = X reads 


az(tr — tz) + (1 + a2)7,, = 0 (75) 


and, if rendered dimensionless, has leading terms of 0(e) ; hence, it will 
be ignored. For completeness, we also note that the kinematic boundary 
condition (7), rendered dimensionless and time-invariant, yields 


VY; OX, 


Vv x’ 
which is the time-invariant continuity equation (9). 


APPENDIX B 
Second-Order Perturbation Term for the Base-Flow Solution [Eq. (17)] 


ala bodtl=ct+S+at Feo) +h ta+F) 


mata the 


1 
Ink 


+ (1 + d) (F ersink + Qabe (Unr#l2)s ps me bad 


ee (1 ta r) (aetlnE — Qhe-Unz!2)t) 


+ a esslnk + — = e BlnE/2)F i, b§ e7 cmt 
18 9 
d2 
“Ble (ita+5) 
+(F+2b-F)a+g+ E+ Pts 
, {a Sead) | ar (ne, t) + OF (- 7 °)| 
+ © F(2Ink, $) + abF (4 S) +5 +2 F(—InE, t) 


+§ | 2G (me, t) + a ae s)|f 
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where 


F(y,$) == aK er eee 4] 


Gn) = 2[(#+ 243) en — 3]. 


and 


APPENDIX C 

Differential Equations for First-Order Dynamic Perturbations in the Radius, a 
Using (83) to eliminate 4 from the first-order variations of the 

momentum equations, we obtain from (85), which reflects the 

quenched, inertia-dominated base state (23) with inertia effects in the 

perturbation equation, 


GU LA e-23 426!" — 44°%e-2 — 24°e-48 + Bae 42 
— 44” — 84’ = 49’ mi Ap! + pl", (76) 


From (84), the inertialess momentum equation, and the unquenched, 
inertialess base state (22), we obtain 


Gl" + Ge = 9” (77) 


Note in (76) and (77) that ( )’ = 0( )/d3 and ( _)* = 0(_ )//@r, 
where 3 = az/L and 7 = avot/L. Finally, from (28) and (30), the 
inertialess momentum equation with the quenched inertialess base 
state (25), we can find 


(FER) Wren -Gin a 


where y = v/vo as in (25), ¢ = 2/L and +r = tvo/L. In this case, 
( =a )/ag. 


APPENDIX D 
Detailed Results for Forced Dynamic Responses of the Tensile Fiber Model 





The detailed expressions for A; and A» resulting from (58) and (61) 
are 


of 4 (1 — €)w 
Aim |5 Cool + xara Bp 


xX [C7123 — (Bi cos w/2 + Bz sin w/2) X psi] 


X [(—Bisin w/2 + Be cos w/2) 


As= {5 [sin]? + + ete X [ (Bi cos w/2 + Bz sin w/2) 


X [C123 + (—Bisin w/2 + Bz cos w/2) X psi] ' 
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where ¢ = e?%z and 
w w/2 
Be = | ci + 9, 82 + cos | 
“a (79) 


B,= si ~£ci+ sin | 
Recall that 6 = az/L. 


w/e 


We now parallel the development of Section VII for the quenched 
base flow (25). Equation (78) with ¢ = 0 suggests a solution of the form 


a({, 7) = Re {if f expla({’ — ¢)] exp[tw(r — &) Jdg’ 


+ 9 exp(twr — af)} , 


where 
v dg* : 
g= I are fo = fi t+ tho, Dy = D, + 182. (80) 
o v(g*) 
Explicitly, 
G({, 7) = Aisin wr + A2 cos a7, 
where 


t 
Ai(g) = i ee 8’) (f, sin wi — fe cos wé)dg’ — Be %, 
0 
(81) 
A2(f) = i] ee GO (fF, cos wi + fo sin wh)dg! + byeW%, 
0 
describing the ¢-dependent phase and amplitude of the response. Note 
that ¢ = 2/L and r = tvo/L, whereas in (79) 6 = af and +r = avot/L. 


We use the same boundary conditions as in developing (52). For 
cyclic radial perturbations at ¢ = 0, 


4(0, r) = sin wr, | (82) 
together with the velocity conditions 
o(0, 7) =0 o(1, r) = 0. 
An expression for é is obtained by eliminating é’ from (28) and (30) 


p= 5474 554 ROGR, (83) 
where G(r) is an arbitrary time function and the differentiation symbols 
mean ( )*'=0( )/dr,( )’ = 0(_ )/dg. Substituting (81) into (83) 
we find 

o(f, 7) = Visinwr + V2cos wr 
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with 
—2w 


MO = fy "eat (f, cos we + fein wE)ds" 
's : 
—2 , et 8-1) (f, sin wi — fe cos wé)dt’ 
+ 2e-% | (Fe + 1) Po — | 
2y . 
+ y (fi sin w& — fe cos wt) + 2G, 
Vals) = 22 [" ex'-D (4, sin wf — fo cos wt)a 
2 yp’ 1 (2) 2 (e) 
—2 i et&'’—O (Ff; cos wé + fe sin wé) dg’ 
0 
nsf pnt (ber)o] 


+ (frcos wt + fesin wf) + 2G. (84) 
Ultimately, (82) leads to 


©, = 0, ®, = —1 
Tt = es + B2C2)/ (Bi i B3 3) fe = (fi:Be = C2)/Bi 


G, = +5 (a + fe cos wo — fi sin wo) (85) 


G=- 5 (w + ficos w&> + fe sin £), 


B, = pd) - K(1) +¥ ” COS wk —1 


Bie — 7 KO) — HQ) + Ssin wf: (86) 
a a ae 

a aes 
= V1 ee — FL 

C2 (Ge +1)e Yo 1. 


The subscripts 0, 1 denote evaluation at ¢ = 0, 1 respectively, and the 
quantities H, K are defined as 


H(f) = ia ex %'—8) sin wédt! 


t (87) 
K(g) = I, ee’) cos wide’. 
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With this notation, we ultimately get 


4(§,7) = [AHO — fe (O) + e-*] sin wr 
+ [fk + fe (g)]coswr. (88) 


APPENDIX E 
Weber’s Derivation of the Capillary Stability Equation 


This appendix gives a simplified derivation of the stability equation 
for the capillary fiber model based on assumptions that are quite 
equivalent to those made for the one-dimensional tensile model. In 
fact, the rationale used here closely parallels that of Section III. 

The analysis proceeds in terms of equilibrium and continuity equa- 
tions, which we write in dimensional form for the entire filament cross- 
section. The perturbed surface radius becomes a + 6 and a given cross- 
sectional element is displaced by vdt along the fiber over the time 
increment dt. The radius of this element now becomes 


06 06 
até+t di + = odt. (89) 


Since the last term is of higher order, it will be neglected. 

The constitutive relations are as in (4) and the derivation of an ex- 
pression for 7, is quite similar to Appendix A. The main difference arises 
in the radial stress boundary condition, where, in distinction from the 
treatment of (72), the longitudinal curvature term cannot be neglected 
for varicose perturbations. Then the r.h.s. of (72) becomes 


6 ’ 

; (5 +6 ) (90) 

and, instead of (74), 
Tt =o (a + i") + 3yv’. (91) 

The continuity equation yields 

ay ean ty (92) 

a 
Now, combining (9) and (10), 


T, = pd + pov’, (93) 


where the last term was apparently overlooked by Weber but seems to 
have little effect on the resulting stability equation. 

Substituting (91) and (92) into (93) and assuming surface perturba- 
tions of the form 6 = 6*e™ cos kz/a, leads to the stability equation 








(8u) + o wont 
2 a ee — p2)\p2 
m? + m ode = Dal (1 — k?)R?. (94) 
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The unstable root, of interest here, is 











_ _ (Buk? o rot. (Su)*k* 3 
m= — Gh + | 5 a — Bye + Gare | (95) 
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An Injection-Molded Plastic Connector for 
Splicing Optical Cables 


By A. H. CHERIN and P. J. RICH 
(Manuscript received February 24, 1976) 


An injection-molded plastic splice connector for splicing optical cables 
has been fabricated and evaluated. Five optical cables containing 90-um 
OD graded-index fibers with 56-um core diameters were spliced, yielding 
an average splice loss of 0.20 dB for the 425 splice joints measured. 
Fifty percent of the losses measured were less than 0.1 dB and 95 percent 
of the splice joints had losses less than 0.8 dB. 

Assembly methods for splicing optical cables using this connector and 
a multiribbon optical-fiber cutting tool capable of cutting 144 fibers semul- 
taneously are also described. 


I. INTRODUCTION 


The feasibility of splicing groups of optical fibers in a laboratory 
environment has been demonstrated by a number of investigators.!~5 
The next phase in the development of optical-fiber splicing is to pro- 
duce splice connectors, based on the concepts that have shown labora- 
tory feasibility that are adaptable to field use. A field-adaptable 
splicing technique will require that telephone crafts people be able to 
splice groups of optical fibers in a routine fashion, with relatively 
simple tools, in a hostile field environment. 

In this paper, an injection-molded splice connector fabricated using 
a mold designed to optimize reproduction of mold dimensions is 
described and evaluated. Assembly methods for splicing optical cables 
using this connector and a multiribbon optical-fiber cutting tool 
capable of cutting 144 fibers simultaneously are also described. 


Il. DESCRIPTION OF SPLICE CONNECTOR AND PRECISION-MOLDING 
TECHNIQUES 
A precision metal mold was used to fabricate a 12-ribbon, multi- 
groove substrate with prealigning slots. The molded plastic substrate 
which forms the base for the optical cable splice connector is shown 
in Fig. 1. It consists of twelve sections. Each section has a prealignment 
slot and a set of twelve fiber-alignment grooves spaced 90 um apart. 
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Fig. 1—Precision-molded substrate for 12 X 12 optical-fiber splice connector. 


The splice is made by properly seating a precut set of ribbons into 
grooves and sliding them together to form a butt joint. A coverplate 
is attached to the substrate and matching material is injected through 
a slot in the coverplate to complete the splice. The completed splice 
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Fig. 2—Histogram of spacing between the internal grooves of metal master. 
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TOTAL POPULATION—480 MEASUREMENTS 
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Fig. 3—Histogram of spacing between the internal grooves of molded polycarbonate 
substrate. 
connector can Join two optical cables, each consisting of twelve ribbons 
that house twelve 90-um op fibers. 

The transverse alignment of the fibers in the grooves of the connector 
is a critical parameter in attaining low-loss splices. Tight tolerances 
were placed on the center-to-center spacing between the grooves. The 
design tolerance for the center-to-center spacing was 3.55 + 0.05 mils. 
Less stringent tolerances were placed on the depth of the grooves. 

The metal master was measured in an optical toolmakers micro- 
scope to determine how well it was machined. Figure 2 is a histogram 
showing the spacing between the internal grooves. The average groove 
width was 3.51 mils. Using the metal master in a screw-injection 
molding machine, a number of polycarbonate substrates were fabri- 
cated under different molding conditions. Samples were randomly 
selected from a batch that was molded to replicate the master as 
closely as possible. Figure 3 is a histogram showing the spacing between 
the internal grooves of the molded polycarbonate substrate. The 
average groove width was 3.49 mils compared with 3.51 mils obtained 
for the metal master. 

Figure 4 is a derived cumulative distribution function of Figs. 2 
and 3 showing a comparison of the groove-width dimensions for the 
master and plastic part. A very small amount of shrinkage, less than 
0.1 mil, appears to have occurred in the plastic substrates. Measure- 
ment repeatability in obtaining this data with the toolmakers micro- 
scope was +0.05 mil. 
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Ill. SPLICE CONNECTOR ASSEMBLY TECHNIQUE 


The splicing of optical cables requires the integration of a number 
of operations including stripping of the cable sheath, ribbon prepara- 
tion, removal of the plastic coatings from the fibers, fiber-end prepara- 
tion, and, finally, the assembly and protection of the splice connector 
itself. Approximately 1 hour and 45 minutes is required to splice two 
optical cables consisting of 12 ribbons (each containing 12 fibers) 
together with the molded connector. Using current techniques, the 
majority of this time (about 1 hour) is spent stripping the ribbons 
and assembling them in the fiber organizers. Fiber-end preparation 
using the multiribbon cutting tool described in the Appendix requires 
about 15 minutes to prepare both ends of the cable. After the 144 
fibers have been cut, the organizer is removed from the cutting tool 
and clamped to a micropositioner stage in preparation for insertion 
into the substrate of the splice connector. As shown in Fig. 5, tapered 
prealignment combs allow the ribbons to be lowered into the prealign- 
ment slots and grooves of the substrate. A mechanical wiper is attached 
to each organizer and is used to massage the fibers to assure that they 
are seated properly in their grooves. After the wiping process is com- 
pleted, epoxy is used to permanently fix the wipers in place. The splice 
is closed by means of an assembly that enables the coverplate to be 
tacked in place with extra-fast-setting epoxy. An epoxy index-matching 
material is then injected through the slot in the coverplate to com- 
plete the splice. To assemble the connector itself requires only 30 
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Fig. 4—Cumulative distribution function of groove widths. 
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Fig. 5—Splice with coverplate epoxied in place showing expanded view of one 
ribbon joint. 


minutes. Since splicing is a parallel operation, all twelve ribbons are 
spliced simultaneously. 


IV. EVALUATION OF SPLICE CONNECTOR 


Using Corning Glass Works graded-profile 90-um op fibers with 
55-um core diameters, adhesive sandwich ribbons® were made and 
formed into short prototype cables for the splicing studies. Following 
the procedures outlined in the previous section, five different cable 
splices were assembled and measured. Included in the statistics quoted 
were all ribbon-to-ribbon splices with twelve fibers present at the 
splice joint. When fiber breakage occurred, ribbon-to-ribbon splices 
with less than twelve fibers present were included in the statistics if 
proper alignment was maintained. Figures 6 and 7 show, for the 425 
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Fig. 6—Histogram of total splice loss data. 


splice joints measured, the histogram and derived cumulative distri- 
bution function of the total splice loss data taken. The average splice 
loss was 0.20 dB with a standard deviation of 0.32 dB. Fifty percent 
of the total losses measured were less than 0.1 dB and 95 percent of the 
splice joints had losses less than 0.8 dB. Five additional outliers, not 
shown in the histogram but included in the statistics, had losses of 
1.60, 1.64, 1.79, 2.08, and 2.66 dB. Four of these high-loss splices 
occurred in one of the cable splices. 
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Fig. 7—Derived cumulative distribution of total splice loss data. 
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To determine the loss in a splice joint, an input beam with a numeri- 
cal aperture approximately equal to that of the fiber was used and the 
input and output power to the splice joint was measured. This tech- 
nique for measuring the loss in a splice joint has been described in 
detail in a previous paper.’ The detector used in this study consisted 
of a United Detector Technology pin long-line photodiode housed in 
a special glycerin-immersed fixture built to accommodate a ribbon 
organizer. To maintain accuracy in the splice loss measurements when 
measuring large groups of fibers (144 fibers in a linear array), it is 
necessary to establish accurate positioning of individual fibers on the 
surface of the detector. Variations in the sensitivity, as a function of 
position on the active surface of a large area detector, can cause errors 
in the measurements which are greater than 0.1 dB. 


V. REQUIRED IMPROVEMENTS AND DISCUSSION 


To maintain a high splice yield with this method of parallel splicing 
of large groups of optical fibers, 12 contiguous fibers must be present. 
If fibers are broken in the ribbons during ribbon stripping, fiber 
organizing, or end-preparation processes, gross misalignment (>10-4m 
transverse misalignment) can occur at the splice joint. The small 
alignment grooves shown in Fig. 8 do not provide adequate guidance 
unless the 12 contiguous fibers are present to force partial alignment 
of the fibers in the connector. 


—~ 


CURRENT DESIGN 





mononcnom 


Fig. 8—Alternate groove depths. 


ALTERNATE DESIGNS 
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Typically, two fibers per cable end are broken in the ribbon stripping, 
cleaning, and organizing processes. The fiber-end preparation process 
yields about 99.7 percent efficiency. Thus, three to five ribbons within 
the group of twenty-four ribbons being spliced in a connector have 
less than twelve surviving fibers and have the potential for being badly 
misaligned. 

The development of automated ribbon-stripping techniques and 
better fiber-handling methods will improve the yield of this process. 
It is very probable, however, that some fibers will break. To prevent 
high splice losses in an entire ribbon, if breakage occurs, requires a 
redesign of the molded connector. Increasing the alignment groove 
depth as shown in the connector designs of Fig. 8 will tend to provide 
guidance for individual fibers independent of the ribbon structure. 
When guidance of this type is achieved, the breaking of an individual 
fiber will not affect the alignment of the remaining fibers in a ribbon, 
and splicing performance will be greatly improved. 


APPENDIX 
A Multiribbon Optical-Fiber Cutting Tool 


The production of low-loss splices between optical fibers or the 
splicing of groups of optical fibers in the form of fiber ribbons and 
cables requires a reliable and convenient method of fiber-end prepa- 
ration. Two basic techniques of end preparation have been developed 
and are described in the literature. The first, a conventional grinding 
and polishing technique, has been used by Miller? and Cherin! in the 
splicing of optical-fiber cables and ribbons. This technique of end 
preparation could be utilized in a controlled environment to prepare 
the ends of factory-installed cable connectors. The second method of 
fiber-end preparation requires the controlled fracturing or breaking of 
fibers as developed by Gloge et al.” A simple cutting tool for preparing 
the ends of individual fiber ribbons has been used by Chinnock et al.* 
and Cherin and Rich! with excellent results. A properly engineered 
tool of this type seems well-adapted for use under field conditions. 

In this Appendix, we briefly describe the design of a cutting tool 
that, operating on the principle described by Gloge et al.,” is capable 
of cutting 12 fiber ribbons (144 fibers) simultaneously. The cutting 
tool has been designed to be compatible with the injection-molded 
splice connector described in this paper. 


A.1 Cutting tool and ribbon organizer 
The fiber-cutting tool, shown in Fig. 9, consists of four basic parts. 


(z) A precision diamond-tip-stylus scoring assembly used to create 
a crack or origin of fracture on the outer surface of the fibers. 
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Fig. 9—Fiber ribbon cutting tool. 


(it) A polished curved surface over which sets of fiber ribbons are 
securely clamped. When the fiber ribbons are stressed over this 
surface, the stress distribution necessary to form flat hackle- 
free ends on the fibers is created. 

(zit) Clamps to secure the fiber ribbons during the scoring and 
stress-application portions of the cutting process. 

(tv) A precision screw, which displaces a clamp and causes the 
application of a longitudinal stress within the fibers. 


A ribbon organizer is used to hold 12 ribbons in the form of a linear 
array. The organizer is clamped to the cutting tool and holds the 
fibers securely in place during the cutting process. After the 144 fibers 
have been cut, the organizer is removed from the cutting tool and is 
ready for insertion into the cable-repair splicing fixture. 


A.2 Cutting experience 


To date 52 cable ends have been prepared using the cutting tool. 
The nominal cable consisted of 12 ribbons each containing 12 fibers. 
Planar ends have been made on 99.67 percent of all the fibers that 
have been cut, 7328 out of 7352 (a few of the ribbons had less than 
12 fibers within them). Typical fiber ends that were prepared using 
the tool are shown in Fig. 10. The total cutting efficiency of the tool 
was determined by the number of fibers surviving the entire process 
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Fig. 10(a)—Optical ribbon with prepared fiber ends. (b) Typical fiber end. 


of clamping, scoring, and tensioning. A total of 7283 fibers success- 
fully survived the entire process, yielding a cutting efficiency of 
99.06 percent. 
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Digital Coding of Speech in Sub-bands 


By R. E. CROCHIERE, S. A. WEBBER, and J. L. FLANAGAN 
(Manuscript received March 26, 1976) 


A rationale is advanced for digitally coding speech signals in terms of 
sub-bands of the total spectrum. The approach provides a means for 
controlling and reducing quantizing noise in the coding. Each sub-band 
ts quantized with an accuracy (bit allocation) based upon perceptual 
criteria. As a result, the quality of the coded signal is improved over that 
obtained from a single full-band coding of the total spectrum. In one 
implementation, the individual sub-bands are low-pass translated before 
coding. In another, ‘‘integer-band” sampling ts employed to alias the 
signal in an advantageous way before coding. Other possibilities extend 
to complex demodulation of the sub-bands, and to representing the sub-band 
signals in terms of envelopes and phase-derivatives. In all techniques, 
adaptive quantization is used for the coding, and a parsimonious allocation 
of bits is made across the bands. Computer simulations are made to 
demonstrate the signal qualities obtained for codings at 16 and 9.6 kb/s. 


I. DIVISION OF SPEECH SPECTRUM INTO SUB-BANDS 


For digital transmission a signal must be sampled and quantized. 
Quantization is a nonlinear operation and produces distortion products 
that are typically broad in spectrum. Because of the characteristics of 
the speech spectrum, quantizing distortion is not equally detectable 
at all frequencies. Coding the signal in narrower sub-bands offers one 
possibility for controlling the distribution of quantizing noise across 
the signal spectrum and, hence, for realizing an improvement in 
signal quality. In earlier work, splitting of the spectrum by high-pass 
and low-pass filtering has been used advantageously for video and 
speech transmission.!? 

A question, then, is what design of sub-bands makes sense for 
speech coding? A choice based upon perceptual criteria is suggested, 
namely, band-partitioning such that each sub-band contributes 
equally to the so-called articulation index (a1). The a1 concept is 
based upon a nonuniform division of the frequency scale for the speech 
spectrum. Twenty nonuniform contiguous bands are derived in which 
each elemental band contributes 5 percent to the total ar. 
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Appealing to this notion, one partitioning of the frequency range 
200 to 3200 Hz into four “equal-contribution’”’ bands is given below 
and shown in Fig. 1. 


Sub-band Frequency Range 
Number (Hz) 
1 200-700 
2 700-1310 
3 1310-2020 
4 2020-3200 


Each sub-band in its original analog form contributes 20 percent to AI. 
The total a1, therefore, is 80 percent, which corresponds to a word 
intelligibility of approximately 93 percent.‘ 


Il. LOW-PASS TRANSLATION OF SUB-BANDS 


A straightforward approach to processing the sub-bands is to make 
a low-pass translation before coding. This facilitates sampling-rate 
reduction and realizes any benefits which might accrue from coding 
the low-pass signal. 

The low-pass translation can be accomplished in a variety of ways. 
One method is shown in Fig. 2. The input speech signal is filtered with 
a bandpass filter of width W, for the nth band. W1, is the lower edge 
of the band and W:, is the upper edge of the band. The resulting 
signal s,(t) is modulated by a cosine wave, cos (Wint), and filtered 


AMPLITUDE !N dB 





FREQUENCY IN kHz 


Fig. 1—Partitioning of the speech spectrum into four contiguous bands that con- 
tribute equally to articulation index. The frequency range is 200 to 3200 Hz. 
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Fig. 2—Sequence of operations for low-pass translation of speech sub-bands, 
adaptive pcm encoding, transmission, decoding, and band restoration. 


by a low-pass filter h,(é) with bandwidth (0 — W,). This filter is 
necessary to remove the unwanted signal images above 2Win, as 
shown in Fig. 2. The resulting signal r,(¢t) corresponds to the low-pass 
translated version of s,(¢) and can be expressed in the form: 


ra(t) = [sn(é) cos (Want) |*ha(d). (1) 


Notice, in this instance, that a constraint is implied by the convolution, 
namely, that the passband width W, S 2Win, or that Wen S 3Wain. 
Practically this poses no problem.* 

The signal r,(¢) is sampled at rate 2W,. If it is already in digital 
form, the sampling rate is decimated (reduced) to the rate 2W,. This 
signal is digitally encoded and multiplexed with encoded signals from 
other channels as shown in Fig. 3. At the receiver the data is demulti- 


*For example, this constraint requires that W be increased slightly, from 200 to 
233 Hz, for n = 1 in Fig. 1 
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Fig. 3—Four-band encoder using iow-pass translation and apcm encoding in each 
band. 


plexed into separate channels, decoded, and interpolated to give the 
estimate 7,(¢) for the nth channel. Reconstruction of the detected 
signal is simply done by the reverse band translation. That is, it is 
modulated by cos (Wint) and bandpass filtered to the original pass- 
band, as shown in Fig. 2. The sub-band signal 5,(¢) is then summed 
with the other bands to give the full-band signal F(t). 

An alternate implementation of the low-pass translation method, 
which avoids the above-mentioned restriction on Wn, follows from a 
modification of the complex demodulation process. In this approach, 
s(t) is complex modulated by e#*"Lwn = (Win + Won)/2 = center 
frequency of band n] and filtered by a low-pass filter h,(é) with band- 
width (0 — W,/2). The resulting complex signal an(t) + jba(é), 


an(t) = [s(t) cos wat |*ha(t) (2a) 
ba(t) = [s(é) sin wat |*h,(A) (2b) 


corresponds exactly to the output of the phase vocoder.’ The conjugate 
of this signal an(é) — jba(é) corresponds to a modulation of s(t) by 
e~ nt, Tf the complex signal a,(t) + jba(é) is complex modulated by 
e4(Wnl2)t and its conjugate complex modulated by e7"/2), the two 
resulting complex signals correspond to the negative and positive 
frequency components of the low-pass translated signal r,(¢), as shown 
in Fig. 4. The sum of these two signals gives a real signal corresponding 
to the desired low-pass translated signal ra(¢); i.e., 


ra(t) = [an(t) + jon ()Je#""* + Lan(t) — joa) Jerirnlrre, (8) 
or 


ra(t) = 2 | ent ee (> t) + ba(t) sin (> ‘)|: (4) 


For reconstruction, it can be shown that an(t) and 6,(t) can be re- 
covered from the low-pass translated signal r,(t) by the following 
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relations 


an(t) = [ra(t) cos (Wat/2) |*h,(t) (5a) 
bn(t) = Era(é) sin (Wat/2) |*h, (0). (5b) 


Iiquations (4) and (5) suggest a method of implementation of the 
low-pass translation and reconstruction with a phase vocoder. For a 
digital implementation of the low-pass translation, this approach is 
particularly appealing. For example, at the sampling rate f, = 2W,/2z, 
the sequences corresponding to cos (Wnt/2) and sin (Wnt/2) are 1, 0, 
—1,0,1, ---, and 0, 1, 0, —1, 0, ---, respectively. Therefore, an effi- 
cient way to generate r,(t) is to sample a, and b» (or decimate if they 
are in digital form) to one half of this sampling rate (i.e., W./2m) and 
form r,(t) by interleaving samples of a, and b, (with appropriate 
sign changes). A similar approach can be used in the reconstruction 
process by recognizing that alternate samples of r,(¢) cos (W.t/2) and 
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Fig. 4—-Frequency-domain interpretation of complex demodulation method for 
low-pass translation. 
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ra(t) sin (W,t/2) (at sampling rate 2W,/27m) are zero valued. Thus, 
the two input sequences to the interpolators (which can be sampled 
at half of this rate or W,/27) can be generated by selecting alternate 
samples of r,(t) (with appropriate sign changes). 

A further modification on this approach can be made by noting 
that, since adaptive coding is used to encode r,(t), the sign changes 
in the construction and separation of r,(t) are not necessary. That is, 
an alternate sequence r,(t) can be generated by interleaving samples 
of a, and b, without sign changes. This sequence can be encoded and 
decoded and inputs to the interpolators can be formed from alternate 
samples of 7,(¢) (without sign changes). Figure 5 shows an implemen- 
tation of this method. The signal s(é) is modulated by cos w,t and 
sin wnt, where w, is the center frequency of band n. These signals are 
filtered with low-pass filters h,(é) with bandwidth (0 — W,/2). The 
outputs are decimated (if they are in digital form) or sampled (if 
analog) at a sampling rate Wn. The low-pass translated signal r,(t) 
is obtained (at sampling rate 2W.) by interleaving samples of a, and 
b,. T(t) is encoded, transmitted, and decoded as in Fig. 3. On recon- 
struction a, and b, are recovered by selecting alternate samples of 
F(t). These signals are then interpolated, filtered, modulated, and 


COS wrt 





SIN wet 





SIN @at 


Fig. 5—Implementation of complex demodulation for low-pass translation with 
interleaving of samples of an and bp. 
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summed as shown in Fig. 5 to give the reconstructed sub-band signal 
Fn(t). 

For digital implementation h, can be realized with a digital filter. 
Decimation, or sampling-rate reduction by an integer factor M, can be 
achieved by retaining only one out of every M samples of the output 
of the filter. The filter is necessary to avoid aliasing. Interpolation by 
an integer factor M is accomplished by increasing the sampling rate 
by filling in M — 1 zero-valued samples between each pair of input 
samples. The filter h, then removes the unwanted harmonic images of 
the base-band signal and smooths (i.e., interpolates) these samples to 
appropriate values of the base-band waveform. Efficient methods for 
implementing digital decimators and interpolators are discussed in 
Ref. (6). 


Ill. ENCODING OF THE SUB-BAND SIGNALS 


Digital encoding of the low-pass translated signal r,(t) is best 
accomplished using adaptive-Ppom (APcM).7? ApcM encoding is preferred 
over adaptive-differential pcm (appcm) methods in this case due to the 
low sample-to-sample correlation of the low-pass-translated, Nyquist- 
rated, sampled signals. 

For computer simulations, apcm coders based on a one-word step- 
size memory were used according to methods proposed by Jayant, 
Flanagan, and Cummiskey.’~® Step-size adaption is achieved accord- 
ing to the relation 


A; => Ay-1 x M, (6) 


where A, is the quantizer step-size used for the rth sample and A,-, is 
the step-size of the (r — 1)th sample. M is a multiplication factor 
whose value depends on the quantizer level at the (r — 1)th sample. 
For example, in a two-bit quantizer, two magnitude levels and the 
sign can be represented. If the smaller magnitude level is used at 
time 7 — 1, M is chosen to have a value M = MM, < 1, and if the larger 
magnitude level is chosen, M = M,> 1 is used. For a three-bit 
quantizer, four magnitude levels and the sign can be represented. In 
this case, there are four choices for M. Through simulations, appro- 
priate values of M for a two-bit quantizer were found to be M71 = 0.845 
and M, = 1.96. For a three-bit quantizer, they are M, = 0.845, 
Mz = 1.0, M3 = 1.0, and M, = 1.4. Note that the three-bit quantizer 
does not change its step-size at time r unless the largest or smallest 
quantizer level is encountered at time r — 1. The above values of M 
are in approximate agreement with values proposed by Jayant’ for 
full-band apcm encoding. 
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0 1 2 3 4 
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Fig. 6—Partitioning of the speech spectrum into four noncontiguous bands to 
achieve reduced bit-rate coding. 


IV. SUB-BAND CODING FOR TRANSMISSION AT DATA RATES 


The transmission bit rate of the sub-band coder can be reduced into 
the range of conventional data speeds by further limiting the sub- 
bands in width and tolerating some spectral gaps as shown in Fig. 6. 
Carried to excess, the noncontiguous bands produce a reverberant 
quality in the signal, such as one gets from comb filtering. In moder- 
ation, however, some highly useful compromises can be achieved 
between transmission bit rate and quality. The coded bands still 
cover a respectable range of the speech spectrum, and provide a 
quality considerably better than coding a single full-band signal. 


V. INTEGER-BAND SAMPLING AND HARDWARE CONSIDERATIONS 


Another attractive alternate implementation of these ideas is to use 
“integer-band” sampling to code a signal that is aliased in an advan- 
tageous way. The technique is illustrated in Fig. 7. 

The signal sub-bands s,(¢) are chosen to have a lower cutoff fre- 
quency of mf, and an upper cutoff frequency of (m + 1)f,, where m 
is an integer and f, is the bandwidth of the nth band. This bandpassed 
signal is sampled at 2f, to produce the sampled spectrum shown in 
Fig. 7 (for m = 2). The received signal is recovered by decoding and 
bandpassing to the original signal band. Typically, values of m from 
1 to 3 are most useful for coder applications with lower bands using 
values of m = 1 and upper bands using m = 2 or m = 38. This integer- 
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band sampling technique achieves the theoretical maximum efficiency 
in sampling.” 

A very attractive advantage of the integer-band sampling approach 
is that it does not require the use of modulators. A slight disadvantage 
is that the above restrictions prevent the choice of bands strictly on 
the basis of equal contribution to a1. However, little loss in performance 
is observed if this equal contribution to ar condition is only approxi- 
mate (within a factor of 2). This implementation was used for per- 
ceptual comparisons, which will be discussed later. 

This approach is especially attractive for implementing the bandpass 
filters as charge-coupled-device (ccp) transversal filters. The analog 
to discrete-time conversion is inherently accomplished by the ccp 
filter with little or no analog prefiltering or post filtering required for 
the prevention of aliasing. The initial signal sampling can be con- 
veniently high, say 15 kHz, to realize the ccp filter, and the filter 
output can be decimated to the 2f, rate for coding. After transmission 
and decoding, the 2f, rate can be interpolated to the 15-kHz rate for 
the final bandpass filtering, again by the analog ccnp filter. 

Another advantage of ccp filters (and also digital filters) is that 
the filter cutoff frequencies are inherently normalized to the initial 
sampling frequency. Therefore, the sampling frequency and, conse- 
quently, the bit rate of the coder, can be varied over a limited range by 


S,(t) ry (t) = 7 
vA Y ENCODER DECODER 


mf TO (m+ 1)f mf TO (m+ 1)f 
SAMPLE 
AT 2f 


AMPLITUDE SPECTRA 


_, dh | | tm saci oraeate 


—4f —3f —2f —1f 0 If 2f 3f 4f (m = 2) 
i ee! (eh ne (eR ce SAMPLE AT 2f 
—4f —2f 0 2f 4f 
Se / Oe Ae), eae SAMPLED SIGNAL 
—4f —2¢ 0 2f 4f 
th lm DESAMPLED SIGNAL 
—3f —2f 0 2f 3f 


Fig. 7—Integer-band sampling technique for digital encoding of speech sub-bands. 
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Table |— Frequencies and sampling rates for the 16-kb/s coder 


Tn (t) 


Sub-band pak 4 Decimation | Quantization 
Center F s 1 , 
No. on (Hz) sbi Rate (6-4) (From 10 kHz) (Bits) 
1 448 1250 16 3 
2 967 1429 14 3 
3 1591 1667 12 2 
4 2482 2500 8 2 


varying the master clock frequency. This cannot be achieved with 
analog filters. 

Present technology is able to provide four 100-tap ccp transversal 
filters on a single integrated-circuit chip or one 200-tap filter on a 
chip with all necessary drivers and control logic. 


Vi. COMPUTER SIMULATIONS OF SUB-BAND ENCODERS 


The sub-band coder has been implemented by computer simulation 
for transmission bit rates of approximately 16 kb/s and 9.6 kb/s. The 
complex demodulation approach in Fig. 5 was used for low-pass 
translation of the bands. An initial sampling rate of 10 kHz was 
employed in both cases. 

The 16-kb/s coder was implemented with the band center frequencies 
and sub-band sampling rates shown in Table I. Bandwidths are equal 
to one half of the sampling rates and correspond to those shown in 
Fig. 1. Three-bit coders were used in the two lower bands, and two-bit 
coders were used for the upper bands. The filters were 125-tap FIR 
filters. As can be observed in Fig. 1, the filters overlap in their transi- 
tion bands and give an overall flat frequency response from 200 Hz 
to 3100 Hz. 

The 9.6-kb/s coder was implemented with the bands given in Table 
II and illustrated in Fig. 6. In this case gaps were allowed between 
bands. Larger filter orders, 175-tap (FIR), were used to reduce transition 
bands and conserve bandwidth. Only the lower band used a three-bit 
coder. Upper bands used 2-bit coders. 


Table Il — Frequencies and sampling rates for the 9.6-kb/s coder 
Sub-band ri r(t) Decimation Quantization 
F S 1 : 
No. Cont req | pap aes, | (From 10 kHz) (Bits) 

1 448 800 25 3 
2 967 952 21 2 
3 1591 1111 18 2 
4 2482 1538 13 2 
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Illustrations of the signal coded for 16 kb/s and 9.6 kb/s by the 
above-band-translation technique are given by the spectrograms of 
Figs. 8 and 9, respectively. In each figure, the upper spectrogram 
corresponds to the original sentence. The middle spectrogram corre- 
sponds to the signal played through the filters, decimators, and interpo- 
lators—but without coders. The bottom spectrogram illustrates the 
sub-band encoded speech at the designated bit rate. 

Other simulations have also been made for encoding the signals 
a,(t) and b,(¢) directly and also for encoding the magnitude and phase 
derivative (as in the phase vocoder). Similar quality results were found 
in these simulations. 


Vil. SUBJECTIVE COMPARISONS WITH OTHER ENCODING METHODS 


Informal listening tests were made to compare the quality of the 
sub-band coder simulations with that of full-band encoding. For the 
16-kb/s coder, comparisons were made with 2- and 3-bit appcm. For 
the 9.6-kb/s coder, comparisons were made with adaptive delta 
modulation (ADM) (i.e., 1-bit appcm). Results for the 16-kb/s coder 
comparisons are given in Table III. 

Twelve listeners were asked to compare pairs of sentences for signal 
quality and indicate which was better. Two speakers were used in the 
experiments and sentence pairs were played in a randomly selected 
order. Each listener made a total of 16 comparisons in each of the 
experiments. 

In comparing 16-kb/s sub-band encoding to 16-kb/s (2 bits/sample) 
ADPCM, listeners rated the sub-band encoded sentence as having higher 
quality in 94 percent of the sentence pairs. When the bit rate of the 
ADPcm coder was increased to 24 kb/s (3 bits/sample), they rated the 
sub-band encoded sentence as having higher quality in 34 percent 
of the sentence pairs. Experiment I demonstrates that the quality of 
the 16-kb/s sub-band coder is clearly preferred over that of AaDPcM 
at the same bit rate. In Experiment II listeners exhibited much 
greater indecision, indicating that the quality of the 16-kb/s sub-band 
coder is close to that of 24-kb/s appcm, but that preference leans 
slightly in favor of the appcm. 

Also included in Table III are signal-to-quantizing-noise ratios 
(s/n) measured on the speech signals, averaged for the two speakers 
for each of the coding methods. s/n data is not found to be a reliable 
indicator of listener preference. This observation is not surprising 
and has been previously recognized in the speech coding literature.”"® 

A second series of listening experiments compared 9.6-kb/s sub-band 
coding with apm. The sub-band encoder in this case is implemented 
with the integer-band method described earlier. The apm coder is a 


DIGITALLY CODED SPEECH 1079 


1080 





“ UNQUANTIZED 


=I 

x 

2 2 

Oo z 

« © 

Oo a 
j i 





ZH» NI AONANDAYS 


THE BELL SYSTEM TECHNICAL JOURNAL, OCTOBER 1976 


2.0 


0.8 


0.6 


4 


0. 


0.2 


TIME IN SECONDS 


unquantized speech output and the 16-kb/s output from the sub-band coder. 


? 


Fig. 8—Sound spectrograms comparing original 





~ ORIGINAL 





“ UNQUANTIZED 
2.0 


8 


1 


6 


1 


4 


1 


0.6 


0.2 


ZHA NI AONANDAYS 


DIGITALLY CODED SPEECH 


TIME IN SECONDS 


-~band coder. 


kb/s output from the sub 


unquantized speech output and the 9.6- 


? 


Fig. 9—Sound spectrograms comparing original 


1081 


Table II|— Comparison of 16-kb/s sub-band coder with ADPCM 
Experiment 1: 16-kb/s Sub-band vs 16-kb/s appcm 


Listener S/N 

Preference (%) (dB) 

16-kb/s Sub-band 94 11.1 
16-kb/s aprcm (2 Bits) 6 10.9 


Experiment 2: 16-kb/s Sub-band vs 24-kb/s appcm 


Listener S/N 

Preference (%) (dB) 

16-kb/s Sub-band 34 11.1 
24-kb/s ADPCM 66 14.5 


forward step-size transmitting coder shown by Jayant to have 
improved performance over conventional apm. Table IV shows the 
results of these experiments. Three different bit rates, 10.3, 12.9, and 
17.2 kb/s, were used for the apm coder. In the first two experiments, 
the 9.6-kb/s sub-band coder was clearly preferred. In the third experi- 
ment, there was greater indecision with preference leaning slightly in 
favor of the sub-band coder. Note that this is true despite the opposite 
ordering of the s/n values! In other words, the perceptual palatability 
is not well reflected in the s/ns as has been observed previously.® 


Table 1V— Comparison of 9.6-kb/s sub-band coder with ADM 
Experiment 1: 9.6-kb/s Sub-band vs 10.2-kb/s apM 


Listener S/N 

Preference (%) (dB) 

9.6-kb/s Sub-band 96 9.9 
10.3-kb/s apm 4 8.2 


Experiment 2: 9.6-kb/s Sub-band vs 12.9-kb/s apm 


Listener S/N 

Preference (%) (dB) 

9.6-kb/s Sub-band 82 9.9 
12.9-kb/s ADM 18 9.7 


Experiment 3: 9.6-kb/s Sub-band vs 17.2-kb/s apM 


Listener S/N 

Preference (%) (dB) 

9.6-kb/s Sub-band 61 9.9 
17.2-kb/s apM 39 11 
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LISTENER PREFERENCE IN PERCENT 
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Fig. 10—(a) Relative comparison of quality of 16-kb/s sub-band coding against 
ADPCM coding (based on listener preference) for different appcm coder bit rates. 
(b) Relative comparison of quality of 9.6-kb/s sub-band coding against apm coding 
for different apm coder bit rates. 


Figure 10 summarizes the results of the listener preference tests in 
Tables III and IV. Listener preference is plotted against the appcm 
and ADM coder bit rates. The crossover points of the curves in the two 
comparisons determine the point at which the two types of coders 
have approximately equal subjective quality. In the first comparison, 
the quality of the 16-kb/s sub-band coder is seen to be comparable 
to that of 22-kb/s appcm; 1.e., it has a 6-kb/s advantage over the 
AppPcm coder. In the second comparison, the 9.6-kb/s coder has a 
subjective quality that is comparable to the 19-kb/s apm and, there- 
fore, has a 9.4-kb/s advantage over ADM. 
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It is clear from the listener preference tests that the sub-band 
coding technique is considerably better in quality than full-band 
ADPCM or ADM coding methods. We have carried this coding down to 
7.2 kb/s and find that the quality is only slightly poorer than that at 
9.6 kb/s. We have also pressed the coding rate down to 4.8 kb/s and 
find that the quality becomes considerably poorer owing to the in- 
creased band limiting and gaps between bands. 


Vill. CONCLUSION 


We have described a method for digitally coding speech in sub-bands 
of the total signal spectrum. Partitioning into sub-bands has several 
distinct advantages. Bit allocations for quantization of each band can 
be made on a perceptually palatable basis. Quantization products in a 
given band are confined to that band and do not “spill over’ into 
adjacent frequency ranges. Selection of sub-band widths can also be 
made according to perceptual criteria, namely, for equal contributions 
to AI (and hence to signal intelligibility). As a result, the sub-band 
coding produces a quality signal that is better than a single full-band 
coding at the same total bit rate. The price paid is the band-filtering 
and the individual coding. 

“Integer-band”’ sampling is demonstrated to be an economical 
and effective method for implementing the sub-band coder. Emerging 
technologies in device fabrication (such as ccps) suggest economical 
implementations of the band filtering in terms of analog transversal 
filters. 

The sub-band coder, implemented by integer band sampling, is 
demonstrated for speech transmission at rates of 16, 9.6, and 7.2 
kb/s. The latter two transmission rates push down into the data 
range and are attractive for ‘‘voice-coordination” over data channels. 

Informal perceptual experiments demonstrate that the signal 
quality of speech coded at 9.6 kb/s by the sub-band method is approxi- 
mately equivalent to a 19-kb/s coding of the full-band signal. For a 
given transmission bit rate, therefore, the sub-band technique provides 
a significant improvement in signal quality. Or alternatively, for a 
given signal quality, the sub-band system can provide the transmission 
at a significantly reduced bit rate. 
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Subjective Evaluation of PCM Coded Speech 
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Subjective quality ratings of PCM coded speech were obtained with the 
aims of (1) determining the effects of certain coder parameters and their 
interactions on speech quality, (12) finding objective measures for predicting 
perceived distortions, and (111) providing guidelines for optimizing coder 
design. Coders with various combinations of four clipping levels, seven 
step sizes, four bandwidths, and three logarithmic companding laws were 
simulated. The coders were rated for quality on a 10-point scale by 48 
listeners who heard male and female speech processed by the coders. 

The ratings depended strongly on clipping level and step size, but only 
weakly on bandwidth. None of the coder parameters interacted strongly 
with another. Clipping noise power grossly overestimated the extent of 
perceived overload distortion; instead, clipping percentage 1s proposed as 
a much more realistic predictor. Signal-to-granular-noise ratio was a 
good predictor of perceived granular noise. For a given bit rate, the coder 
with the highest quality rating was not the coder with minimum total 
clipping and granular noise power, contrary to traditional wisdom. 


l. INTRODUCTION 


“How does it sound?” This is a fundamental but elusive question 
for the engineer designing or evaluating a system for transmitting, 
recording, or processing speech signals. If the system is analog, the 
engineer has as a guide a substantial body of information about the 
interrelated effects on speech quality of such factors as attenuation, 
noise, linear and nonlinear distortion, echo, and cross-talk.1 With 
respect to digital systems, however, the subjective effects of charac- 
teristic distortions have been documented to a much smaller extent 
and, as a consequence, the quality of an existing system and the 
merits of proposed designs are much harder to predict. 

One approach to the evaluaton of digital systems is to relate a 
digital signal distortion to one of the analog distortions, and to define 
digital speech quality as the subjective correlate of the equivalent 
analog distortion.? Although expedient and reasonably accurate for 
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certain individual distortions, the value of this approach seems quite 
limited in the important situations where several distortions occur 
simultaneously. 

While the engineering literature contains many reports of subjective 
tests of digitally coded speech, most of the tests were undertaken to 
provide performance data on the overall distortions produced by 
specific coders. Among the exceptions to this approach and more 
aligned with the spirit of our work are the experiments reported by 
Donaldson and Chan,’ O’Neal and Stroh,‘ and Yan and Donaldson? in 
which individual sources of distortion were identified and the manner 
of their interaction investigated. In these studies, the effects of band- 
width, predictor network, number of bits per sample and transmission 
error rate in pcm (pulse code modulation) and differential pcm systems 
were studied. Another design variable, quantizer overload point, was 
held fixed although Ref. 5 ends with the suggestion, ‘‘A careful study 
of the dependence of subjective quality on... [overload point]... 
seems necessary.’’ Our experiment contains a thorough study of the 
role of this parameter in PCM. 


Il. AN OVERVIEW OF THE EXPERIMENT 


We used a digital computer to process speech with 208 different 
pcM coding schemes whose characteristics span an important range 
of bandwidths, number of bits per sample, overload levels, and com- 
pression characteristics. Our aims included the study of: (2) the in- 
fluence on speech quality of the above design parameters, (77) objective 
measurements that are good predictors of speech quality, and (222) 
optimum combinations of code parameters. 

In the experiment, 48 listeners used a 10-point opinion scale to 
provide quality ratings of speech processed by each of the coders. 
The speech material consisted of 10 sentences, each spoken by two 
females and two males. Our principal conclusions from the analyses 
of the data are: 


(z) Overload level and quantizing step size were primary deter- 
miners of listeners’ ratings. Bandwidth was, by comparison, a 
secondary determiner of speech quality. 

(iz) The traditional objective measurement, overall signal-to-noise 
ratio, was not a useful predictor of speech quality. On the other 
hand, the percent of samples clipped, P and the signal-to-noise 
ratio, Q of the granular quantizing noise were useful and 
independent predictors of speech quality. A simple linear 
equation 

kR=aP+bQ+¢, 
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Fig. 1—Block diagram of major steps in the experiment. 


where a, b, and c are empirically derived constants, was a 
good predictor of the quality rating R. 

(<iz) For a fixed number of bits per sample, the coder with the 
highest quality rating was not the coder with the highest 
signal-to-noise ratio. 


The experiment involved three major steps, as shown in Fig. 1. 
The first step was to compile a source speech library consisting of high- 
quality recordings of sentences. The second step was to simulate a 
variety of coders and noise processes on a computer. The final step was 
to process the source speech with the simulated coders and noise 
processes in accordance with an overall experimental design and to 
obtain subjective quality ratings from listeners. 


lil. SOURCE-SPEECH LIBRARY 


Digital recordings were made of the ten phonetically balanced 
sentences listed in Table I as spoken by two females and two males. 
The talkers were seated in a sound-proof booth and spoke into a Sony 
ECM 22p microphone. The amplified microphone signal was low-pass 
filtered at 9.6 kHz, sampled 24,000 times per second, uniformly 
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quantized to 12 bits per sample, and written onto digital tape. Each 
sample was represented by an integer between — 2047 and +2048. For 
each talker, the quantizer step size was adjusted manually to use the 
full quantizer range without clipping. Once a step size was established 
for a talker, the same step size was used for all ten sentences. This 
procedure approximately equalized the peak power level of the four 
talkers over all sentences. The source-speech library thus consisted of 
digital recordings of 40 sentences containing all the speech sounds 
spoken by four talkers and approximately equalized for peak power 
over talkers. 


IV. SIMULATION OF CODERS AND NOISE PROCESSES 


A pcm system contains a low-pass presampling filter of bandwidth 
W, a sampler that generates 2W equally spaced signal samples per 
second, a quantizer operating independently on each sample, and a 
low-pass desampling filter of bandwidth W which generates a con- 
tinuous waveform from the quantized sequence. In the experiment, 
each of these components—presampling filter, quantizer, desampling 
filter—was simulated on a DDP-224 digital computer. Within the 
computer, “analog speech’? appeared in the 24,000-samples/second, 
12-bits/sample format of the recording scheme, while sampled and 
quantized speech appeared with fewer bits and fewer samples. 


4.1 Bandlimiting and sampling 


The four sampling rates used in the experiment were all integer 
submultiples of 24 kilosamples/second: 12, 8, 6, and 4.8 kilosamples/ 
second and the nominal cutoff frequencies of the associated low-pass 
filters were 6, 4, 8, and 2.4 kHz, respectively. The filters, all realized 
as finite impulse-response digital filters with integer coefficients, were 
designed to meet the requirements listed in Table II, which conform to 


Table |— The ten sentences spoken by each of four talkers * 


. A lathe is a big tool. 

. Grab every dish of sugar. 

An icy wind raked the beach. 
Her father failed many tests. 

. Joe brought a young girl. 

The chairman cast three votes. 

. The boy was mute about his task. 
. Beige woodwork never clashes. 

. Both teams started from zero. 

. My cap is off for the judge. 


— 


* Each is a simple declarative sentence that can be spoken in approximately two 
seconds. The list includes all the phonemes of English in initial, final, and intervocalic 
position. 
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Table Il — Bandlimiting filter specifications 


Sampling rate (kilosamples/s) 12 8 6 4.8 
Nominal cutoff W kHz (Attenuation at 

least 15 dB at f = W) 6 4 3 2.4 
Passband edge attenuation within 

+0.125 dB 4.5 3 2.25 1.8 
Stopband edge attenuation at least 30 dB 7.125 4.75 3.562 2.85 
Filter order 21 33 41 51 
Oversampling ratio 2 3 4 5 


the requirements imposed on channel banks of digital multiplex 
systems. 


4.2 Interpolation 

The digital interpolating filter simulates the desampling filter of a 
pom coder. The latter transforms a sampled signal to a continuous 
waveform. In the computer, “continuous waveforms” appear as 
samples occurring at the rate of 24,000 per second; to produce them, 
an interpolating filter inserts 1, 2, 3, or 4 new samples between each 
pair of pcm samples, depending on whether the sampling rate of the 
simulated coder is 12, 8, 6, or 4.8 kHz, respectively. 


OUTPUT 


INPUT 


Fig. 2—Input/output diagram of a quantizer. 
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While, in practice, specifications of desampling filters are often 
identical to those of presampling filters, we found for our purposes 
that a 30-dB stopband attenuation was insufficient at certain fre- 
quencies. Some speech sounds, particularly nasals, with strong low- 
frequency components produced audible output tones in the vicinity of 
the sampling frequency in desampling. For example, the spectrum of 
sound with considerable energy around 200 Hz has images at 5800 Hz 
and 6200 Hz when sampled 6000 times per second. Even attenuated 
40 dB, these images produced an audible ‘‘whistle,’”’? which was very 
distracting to listeners. In the design of interpolating filters, therefore, 
we specified an attenuation of at least 65 dB near the sampling 
frequency. 


4.3 Quantization 

A quantizer is defined by an input/output diagram such as Fig. 2. 
To study the subjective effects of quantization, it is appropriate to 
formulate this operation as a sequence of four processes as in Fig. 3: 
clipping, compression, uniform quantization, and expansion. 

Clipping is an inherent part of the quantizing operation. Figure 2 


CLIPPING COMPRESSION 


zZ 





UNIFORM 
QUANTIZING EXPANSION 


q 





Fig. 3—Four processes included in quantization. 
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shows that the largest magnitude that can be represented by the 
quantizer is A. All samples greater than A result in the same output, 
as do all samples less than —A. Hence, the quantizer operates as a 
device that first clips the input and then represents with finite reso- 
lution all signal samples in the range —A to A. 

Compression and expansion (each the inverse of the other) are 
nonlinear transformations of the uniform quantizer’s analog input 
samples and quantized output samples, respectively. Current trends 
in communication technology favor the use of segment-compression 
characteristics, which are piecewise linear approximations to loga- 
rithmic input/output relationships. In the experiment, we simulated 
segmented y-law quantizers* in which the length of each linear segment 
is double, and the slope one-half, that of the previous segment.* The 
compression curve in Fig. 3 contains five segments. There are three seg- 
ments for positive inputs and three for negative inputs, with the inner- 
most positive and negative segments colinear. In the input/output 
characteristic, the quantization step size is constant over a segment and 
double that of the previous segment. Hence, high-level samples are 
quantized more coarsely than low-level samples. 

In practice, the number of positive (or negative) segments is a 
power of 2 so that the total number of distinct segments can be written 
as 2@+1) — 1, In the experiment, we studied quantizers with c = 0 
(uniform quantization), c = 2, and c = 3, which are 1, 7, and 15 
segment quantizers with parameter » = 0, 15, and 255, respectively. 

We describe the uniform quantizer in Fig. 3 by its step size S which 
is equal to the minimum step size of the nonuniform quantizer of Fig. 2. 

The entire quantizer is now defined by three parameters: the over- 
load level A, the companding number uz, and the step size S. For 
engineering purposes, the most important quantizer parameter is the 
number of bits per sample B. Table III shows the dependence of B 
on A, », and S over the range of parameters appearing in the experi- 
ment. While, in engineering studies, quantizers are usually specified by 
nu, B, and S or by yu, B, and A, the design and analysis of experiments 
such as this one are greatly facilitated by viewing uv, A, and S as the 
independent variables of a quantizer. The advantages of this point of 
view derive from the fact that quality varies monotonically with both 
A and S. The relationship of quality to B is considerably more com- 
plicated (see Section VIII) and is more readily derived as a combination 
of two relatively simple functions than measured directly. 

Because the source speech appears in the computer encoded in 


* The compressor characteristics are piecewise linear approximations to 


z = sgn(y)Llog (1 + 2 |y|) J/log (1 + x). 
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Table IIl— Number of bits as a function of step size 
and clipping level 


w=0 
Clipping level 
2048 1024 512 206 128 64 32 


1 12" 11 10 g* st CG 
2 11" 10 9 8* 7 6 +5 
4 10* 9 8 7” 6 Bs 
8 9* 8 7 6* ne 
2 * 6 4* * * 
Step size 64 6 5 4 3* Q* 1" 
128 5 4 3 2 1 
256 4 3 2 1 
512 3 2 1 
1024 2 1 
2048 1 
is 5 


Clipping level 
1920 960 480 240 120 60 30 


1 10* 9 8 ‘i 6* 5* 4 
2 9* 8 7 6* 5* 4* 3 
72 2 aE Se? 
7, * 5 4* * 
Step §1ze 16 6* 5 4 3* 
32 5* 4 3 
64 4* 3 
128 3 


pw = 255 
Clipping level 
2040 1020 510 255 


1 on ae 
; 6* 
Step size 4 5 4 

8 4* 


* Indicates quantizers used in experiment. 


steps of 1 from —2047 to 2048, A cannot exceed 2048 and S cannot 
be less than 1. Hence, for each c, there is an upper limit on the number 
of bits per sample that can be simulated. The limit is 12 bits for c = 0, 
10 bits for c = 2, and 7 bits for c = 3. Conversely, there is a lower 
limit on B because there must be at least one output level for each 
positive segment and one for each negative segment in the compression 
curve. This implies that the c = 2 quantizer must have at least 3 
bits/sample and the c = 3 quantizer at least 4 bits. 

After a pilot experiment, we decided to vary S in octave steps. 
Table III shows for each companding law the values of S, A, and B 
that can be simulated by our procedure. An asterisk indicates a 
quantizer used in the experiment. The first column of each matrix 
contains quantizers with no clipping. We omitted quantizers in the 
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second and third column because the pilot study suggested that the 
deterioration in quality associated with the transition from the first 
to the fourth column was approximately the same as the deteriorations 
in the transition from fourth to fifth, and fifth to sixth columns. 


4.4 Noise processes 


In addition to speech degraded by the pcm coding process, the ex- 
periment included speech distorted by two types of additive noise. 
There were four levels of simulated white gaussian noise added to 
speech samples; the noise levels were chosen to provide signal-to-noise 
ratios around 30, 20, 10, and 0 dB. It was felt that gaussian noise is 
similar in character to the granular quantizing noise of a uniform 
quantizer (c = 0). 

In addition, we included four levels of speech-dependent noise.’ To 
each sample x7, was added -+-pxn, where p is the noise-to-signal ratio 
and the + or — sign is determined by a simulated coin-toss. Thus, 
the noise magnitude added to each sample is proportional to the 
magnitude of the sample. This type of distortion is similar in character 
to the quantizing noise of a companded quantizer in which the noise 
magnitude increases in a probabilistic sense with signal magnitude. 
The four speech-dependent noise levels provide the signal-to-noise 
ratios 30, 20, 10, and 0 dB, where s/n = 20 log (1/p) dB. 


V. SUBJECTIVE EVALUATION OF TRANSMITTED SPEECH 


After all simulations were completed, the source speech was pro- 
cessed by the coders and noise processes and the processed speech 
written onto digital tape to form a transmitted speech library, as 
shown in Fig. 1. When the library was complete, the transmitted 
speech was converted from digital to analog and recorded onto audio 
tape for subjective evaluation. 

Four analog tapes were prepared, each containing one example of 
each of the 240 experimental conditions: (52 coders + 8 noise con- 
ditions) X 4 bandwidths. The assignment of talkers to conditions 
followed a latin square design in a bandwidth by tape-number matrix. 
Thus, for a given coder or noise condition, a different talker was 
associated with each of the bandwidths on a single tape. Over the four 
tapes all 16 talker-bandwidth combinations appeared with each coder 
and noise condition. For a given bandwidth, each sentence occurred 
6 times and each talker 15 times over the 60 noise and coder conditions. 

The 240 conditions on each tape were presented in random order 
to 48 students at a local university, who listened to the stimuli on 
TDH-39 earphones. Twelve subjects judged the stimuli of each tape. 
They were asked to “rate each sentence on a scale of 1 to 10 according 
to its acceptability as a communication link, using 1 to represent the 
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least acceptable, 10 the most acceptable, and the other numbers 
between 1 and 10 for intermediate ratings.”’ Before the test began, 20 
representative conditions were presented to familiarize the listeners 
with the range of speech quality. 


VI. OBJECTIVE MEASURES OF SPEECH QUALITY 


Because signal-to-noise ratio (s/n) is the most frequently cited 
measure of speech quality, the relationship of s/n to subjective 
appraisal of processed speech is a matter of substantial interest in an 
experiment such as ours. A very strong inference of our data is that a 
single s/n statistic—the ratio of the power in the original speech to 
the power in the difference between processed speech and original 
speech—is a poor predictor of subjective quality. Instead, we find 
that the effects of clipping and granular quantization must be con- 
sidered separately if we are to arrive at a correct prediction of per- 
ceived quality. Therefore, we define two noise components: NC, the 
clipping noise, defined as y — «x in Fig. 3, and the granular quantizing 
noise NG, defined as gq — y. The total quantizing noise is 


NQ=q-—x=NC+NG. 


To facilitate measurement of these and other quantities for each 
quantizer, we produced a digital data tape which, for each utterance 
passed through each presampling filter, recorded the number of times 
each possible sample amplitude (from — 2047 to 2048) occurred. We 
used this tape to calculate the power in each filtered utterance, the 
mean square values of NG, NC, and NQ for each quantizer, and addi- 
tionally, the percentage of samples clipped by each quantizer. This 
last statistic, P, proved a better correlate of listener opinions than the 
mean square value of NC. 


Vil. RESULTS 
7.1 Overview of data analyses 

Statistical procedures were applied to evaluate the relative influence 
of each of the experimental variables on the listeners’ ratings. Analyses 
of variance showed that two variables, clipping level A and step size S, 
were the major sources of variability influencing the ratings. Multiple 
regression procedures provided linear estimates of ratings as functions 
of two objective distortion measures, one related to A, the other to S. 


7.2 Determiners of speech quality 
7.2.1 Listeners and tapes 


An analysis of variance was computed to study the variability of 
the ratings of the 12 listeners who judged a single tape, and the vari- 
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ability among the four listener groups, each of which judged a different 
tape. The analysis showed that the listeners within each group were 
not significantly different in their ratings and that the ratings among 
the four groups of listeners were not significantly different. In each 
case, the F ratios were less than 1.0. Therefore, the mean of the 
listeners’ ratings for each condition was used for all further analyses. 


7.2.2 Coder parameters 


A second analysis of variance was computed, using the means of 
the listeners’ ratings, to study the effects of the experimental variables. 
In this analysis, the differences in the ratings due to step size, clipping, 
and companding were statistically significant, as expected, and in 
combination accounted for 84 percent of the total variance. While the 
variability in the ratings due to the different talkers, the different 
bandwidths, and their interaction were all statistically significant, each 
of these effects accounted for only 2 to 3 percent of the total variance. 


7.2.3 Talkers 


Figure 4 shows the mean rating across clipping and step size as a 
function of bandwidth for each talker at the three companding values 
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Fig. 4—Mean rating across clipping level and step size as a function of bandwidth 
for cach talker and each companding law. 
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Table IV — Mean ratings across talkers and listeners 


Companding p=0 w= 15 p = 255 
Clipping Level 2048 256 128 64 1920 240 120 60 2040 255 
1} 9.2 88 7.0 4.7 9.1 7.6 6.9 4.7 9.2 5.0 
2} 92 85 63 4.4 91 74 53 3.4 7.8 
6 kHz 4) 9.1 83 6.1 4.1 8.3 6.2 5.2 3.3 6.5 
Band-| Stepsize 8} 8.1 7.1 5.8 4.0 8.7 4.8 3.8 4.6 
width 16] 8.1 63 4.8 3.8 7.6 3.9 
382; 6.3 5.1 40 2.4 5.9 
64; 5.0 3.8 3.2 1.8 4.3 
1/ 90 7.6 6.3 3.6 8.7 7.2 53 3.8 7.7 48 
2} 83 76 64 4.2 8.6 6.7 4.7 3.8 7.3 
4 kHz 4} 83 6.8 5.7 4.2 8.2 63 5.1 3.2 6.0 
Band-| Step size 8| 7.2 6.2 4.9 4.3 7.5 5.3 3.7 5.2 
width 16| 6.7 5.7 4.7 3.6 6.9 3.9 
32; 5.9 48 3.6 2.5 5.6 
64; 5.3 386 3.1 1.2 4.7 
1) 7.9 6.7 5.3 3.9 8.1 64 5.1 3.6 8.2 4.5 
2| 7.9 66 5.2 3.7 8.7 6.2 43 3.4 6.9 
3 kHz 4/ 72 64 4.5 4.1 7.8 63 44 2.7 5.7 
Band-| Stepsize 8/ 6.6 5.6 5.3 3.3 6.6 4.6 3.3 5.0 
width 16) 58 49 44 2.7 5.7 4.1 
32|} 4.7 3.9 36 2.1 4.5 
64) 3.8 2.9 26 1.3 3.6 
1| 9.4 7.7 6.0 4.7 8.9 8.0 55 3.9 8.4 4.6 
2/ 9.1 69 62 4.4 8.4 7.2 5.1 4.0 7.5 
2.4 kHz 4} 82 7.3 6.1 4.3 8.1 6.1 4.1 3.3 6.0 
Band-| Step size 8| 8.0 6.7 5.6 4.2 8.1 5.7 3.5 5.2 
width 16; 7.1 5.7 5.1 3.2 6.7 4.3 
32| 56 4.5 4.2 3.0 5.8 
64} 49 44 28 1.38 5.0 


uw = 0, 15, 255. Although there was some evidence that the coded 
speech of female talkers was rated somewhat lower than that of male 
talkers, the major source of the statistically significant differences 
among the talkers and the talker-bandwidth interaction was the 
consistently lower ratings assigned to one female voice. The mean 
power of her speech was approximately 3 dB greater than that of the 
other three talkers and the standard deviation of the power about 0.2 
dB less, making her speech more sensitive to clipping and filtering. 
Since the effect of the talkers was minimal, the mean rating across 
talkers was used for further analyses, thus reducing the variability 
in the data to that due to the influence of only the physical variables 
of the coders. The mean ratings across talkers and listeners are shown 


in Table IV. 
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7.2.4 Bandwidth 


Figure 4 also shows the effect of the four different bandwidths on the 
ratings. Although the ratings tended to decrease as the bandwidth 
narrowed, the differences between 6, 4, and 3 kHz were very small. 
Indeed, the source of the significant differences due to bandwidth was 
the much lower ratings that resulted from reducing the bandwidth from 
3 to 2.4 kHz. The ratings pertaining to three of the talkers contained 
no significant interactions between bandwidth and the other coder 
design variables. Only in the data for the female talker with the low 
ratings were these interactions statistically significant. The most 
salient of these interactions was between bandwidth and clipping. 


7.2.5 Clipping, step size, and companding 


Figure 5 shows the mean rating across listeners, talkers, and band- 
width at each step size as a function of clipping level, A, for each of 
the three companding conditions. The horizontal axes show A de- 
creasing (i.e., the amount of clipping increasing) from left to right. 
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Fig. 5—Mean rating across listeners, talkers, and bandwidth at each step size as 
a function of clipping level for each companding law. 
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Fig. 6—Mean rating across listeners, talkers, and bandwidth at each clipping level 
as a function of step size for each companding law. 


For each step size S the mean rating decreases as the amount of 
clipping increases. With » = 0 or uw = 15, an octave decrease in A 
results in a relatively small decline in rating when A > 256 and a 
relatively large decrease when A S 256. The data for » = 255, though 
limited, suggest that relative to the other compression laws, ratings 
are more sensitive to small amounts of clipping (A > 256). 

Figure 6 shows the same data points plotted as a function of step 
size at each clipping level. While, for each A, the ratings are inversely 
related to S, equal incremental differences in S tend to result in larger 
differences in the ratings as S increases. That is, the curves generally 
have a steeper slope when S > 8 for » = 0, and S > 4 for w = 15. 
The steeper slope of the curve for » = 255 suggests that quality may 
be influenced by an interaction between » and S. An analysis of the 
ratings of unclipped speech with S = 1, 2, 4, and 8 confirms this 
observation. While the ratings for » = 0 and » = 15 were not signifi- 
cantly different, those at » = 255 were significantly different from 
the ratings for the other two companding laws. 


7.3 Prediction of quality ratings 
7.3.1 Signal-to-noise ratio 


Figure 7 is a scatter plot of average rating vs measured s/n for the 
28 uniform quantizers and the 4 white-gaussian-noise processes. Here, 
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s/n is the usual engineering measure: the ratio of signal power to 
mean-square difference between quantizer input and output (a — q 
in Fig. 3). The s/n coordinate of a point in Fig. 7 is the average of the 
16 s/n’s of the individual utterances (4 bandwidths by 4 talkers) 
processed by a coder or noise condition. The most important feature 
of Fig. 7 is the horizontal clustering of the seven points associated with 
a given value of A, when A S 256. In all of these quantizers, the 
clipping noise, NC, substantially dominates the granular noise, NG, in 
the total noise, NC + NG. This dominance implies that s/n is virtually 
independent of S with A S 256, while, by contrast, perceived dis- 
tortion depends strongly on S, as evidenced by the vertical spread of 
the points pertaining to each A. Clearly, in the presence of clipping, 
s/n is a poor guide to ratings of speech quality: coders with the same 
s/n elicit widely divergent ratings. 


7.3.2 Noise references 


In Fig. 7, ratings and s/n are well correlated for one set of coders: 
those with no clipping, A = 2048. Here, the relationship of rating to 
s/n is similar to that observed for the gaussian noise processes. Figure 
7 suggests, therefore, that for uniform quantizing, white gaussian noise 
is a good noise reference when there is no clipping; it is a poor noise 
reference when clipping is significant. 
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Fig. 7—Relationship between mean rating and total s/n for the 28 linear conditions 
and the 4 gaussian-noise conditions. 
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Fig. 8—Mean ratings of unclipped speech for each companding law and the added 
noise conditions as a function of the total s/n. 


Because the amount of granular distortion produced by a coder 
with companding depends on signal amplitude, one may suppose that 
speech-dependent noise would provide a better noise reference than 
white gaussian noise for companded coders.?:? Figure 8 lends support 
to this conjecture by showing, for all of the coders with no clipping and 
all of the noise processes, the relationship of average rating to s/n. 
For » = 255, the relationship is similar to that observed with speech- 
dependent noise; for » = 0, it is similar to that observed with white 
gaussian noise. uw = 15 is an intermediate case. 


7.3.3 Regression analysis 

Because total s/n proved a poor correlate of listener ratings, we 
turned to multiple regression procedures to find an objective predictor 
of the ratings. The analyses of variance indicated that the ratings were 
primarily influenced by S and A, which have nearly independent 
effects.* Consequently, we used as independent variables of the 
regression one distortion measure related to S and one related to A. 
Appropriate measures proved to be Q, the granular s/n (ratio of signal 
power to power in NG) measured in dB, and P, the clipping proba- 
bility (the proportion of speech samples > A), expressed as a 
percentage. 


“In the analyses of the linear conditions, S accounted for 35 percent of the total 
variance, A for 45 percent, and the interaction of S and A for only 1 percent. 
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The regression was computed at each companding level with the 
original values for each talker at each bandwidth included as repeated 
observations. For example, each of the 28 coders with » = 0 was 
represented by 16 measurements (4 talkers X 4 bandwidths). Table V 
lists the coefficients obtained by the regression procedure at each 
companding level and also by combining the three companding levels. 
While the values of the coefficients change with u, the correlations and 
the rms values of the residuals do not change radically. The regression 
procedure was applied to the data for the coders with u = 255 for 
completeness, but the computation was based on information for only 
five coders at the smaller step sizes and only one clipping condition. 
When the ratings of the three companding laws were also included as 
repeated observations, the ratings predicted by appropriate weighting 
of only the Q and P correlate highly with the obtained ratings. 


Vill. DISCUSSION 
8.1 Effects of design variables 


Among the pcm design variables, system bandwidth W had the 
smallest effect on the ratings, a finding consistent with that of O’Neal 
and Stroh‘ who state that ‘‘over the range of 2.4—4.3 kHz changes in 
the bandwidth W of the speech signal are inconsequential in terms of 
the resultant user ratings.’ (To describe our data, we would substitute 
‘of minor importance” for ‘‘inconsequential.”) In considering the 
practical application of this conclusion, a caveat is necessary. In a 
recent experiment, Goodman, Goodman, and Chen? found that band- 
limiting, although less important than clipping and quantizing in 
determining listener ratings, had a very strong effect on consonant 
intelligibility. This suggests that the impact of band-limiting on the 
quality of communication may be more substantial than the results of 
rating tests imply. 

The significance of the dependence of ratings on the quantizer 
variables, » (compression law), S (step size), and A (clipping level), 


Table V — Coefficients obtained by regression level 
(R=aP+bQ+c) 


Corre- RMS 


a b Cc lations Residual 
zp =0 —0.08 0.09 3.87 0.87 1.005 
w= 15 —0.11 0.11 3.09 0.87 0.957 
p = 255 — 0.27 0.16 2.96 0.84 0.825 
Combined —0.10 0.09 3.99 0.85 1.038 


P = Percent clipped. 
Q = s/n granular quantizing noise. 
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(b) w= 15 
Fig. 9—Mean ratings at a constant number of bits as a function of clipping level 


for (a) wu = 0 and (b) w = 15. A circle indicates the highest-rated quantizer of a 
certain number of bits. 
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will be more apparent if we return to the usual engineering description 
of a quantizer which includes the number of bits per sample, B, as an 
independent variable. Thus, in Fig. 9, we have plotted the same points 
that appear in Fig. 5, but in this case, we have drawn lines showing 
contours of constant B rather than constant S. Here we see the effect 
on subjective ratings of the well-known compromise between clipping 
and quantizing in coder design. At the left of each curve, we have 
the quantizers that cause little or no overload but have high step sizes 
and, therefore, substantial granular quantizing noise. At the right, 
clipping distortion predominates over granular noise. 

Figure 10 demonstrates the effect of companding on ratings by 
displaying on the same graph rating vs clipping level curves for 5-bit 
and 6-bit encoders with » = 0 and » = 15. For a given clipping 
level, even this small amount of companding (practical values of u 
are 100 and 255) produces substantially higher ratings than those 
given the uniform quantizer. The companding advantage is well known 
and accounts for the presence of compandors in all pcm transmission 
systems. In terms of statistical signal theory, we may explain the 
advantage by saying that a nonuniform quantizer provides a better 
match to the probability distribution of speech amplitudes than a 
uniform quantizer. A perceptual explanation is that the low-level 
portions of a speech signal carry the most information. With A and B 
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Fig. 10—Mean rating as a function of clipping level for 5 and 6 bits at p = 0 
and » = 15, demonstrating the effect of companding. 
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given, a nonuniform quantizer codes low-amplitude samples with a 
smaller step size (and, therefore, lower NG) than the corresponding 
uniform quantizer. 


8.2 Objective measures of distortion 


Figure 11 shows, for coders with uniform quantizers, total s/n as a 
function of A. There are striking differences between these curves and 
Fig. 9. The most important differences are in the locations of the 
maximum points on corresponding curves and the substantially 
steeper slopes to the right of the maxima in Fig. 11. Both of these 
differences reflect the fact that NC increases very rapidly from zero as 
the clipping level decreases from A = 2048, while, by contrast, listener 
opinions are relatively insensitive to clipping until A < 512. 

The disparity between Figs. 9 and 11 suggests that even with B 
constant, s/n, the usual engineering measure of quantizer quality, 
is a poor guide to subjective ratings, mainly because the mean-square 
clipping is a poor predictor of listener ratings. A more useful measure 
of clipping distortion is clipping probability, which we have measured 
as the percentage of samples clipped in an utterance. P varies with A 
in the manner shown in Fig. 12. Observe that, like the ratings, P 
changes slowly as A decreases from 2048 and that it is most sensitive 
to changes in A when A < 512. These similarities account for the 
accuracy of the regression formulas in Table V, which have as in- 


TOTAL S/R 





2048 1024 512 256 128 64 
CLIPPING LEVEL 


Fig. 11—Total s/n as a function of clipping level at a constant number of bits. 
Circles indicate the maximum s/n at each bit rate. 
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Fig. 12—Percent clipped as a function of clipping level. 


dependent variables P and the signal-to-noise ratio Q of the granular 
quantizing noise NG. These formulas, based on the notion of two 
perceptually distinct distortions, are more useful predictors of subjec- 
tive quality than is total s/n, which is based on the fallacious assump- 
tion that listeners attend only to the difference between quantizer 
input and output with no regard to the components of this difference. 


8.3 Optimum quantizers 


Given a companding law and a fixed number of bits per sample, 
there is an optimum quantizer with overload point A* that provides 
the best mixture of clipping distortion and granular noise. For 
A < A*, clipping is the predominant type of distortion; for A > A* 
granular noise predominates. A circle in Fig. 9 indicates the subjec- 
tively optimum overload point for a given bit rate. As the number of 
bits per sample increases, so does A*. In high-resolution quantizers, 
it is possible to have low granular noise and very little clipping simul- 
taneously. Notice that the optimum points in Fig. 9 are all one or 
two octaves to the right of the corresponding points in Fig. 11. The 
experiment demonstrates that listeners are more tolerant of clipping 
than s/n measurements suggest. In addition, the curves in Fig. 9 are 
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considerably broader than those in Fig. 11, which indicates that 
listeners are relatively insensitive to changes in A in the region of A*. 

This observation relates directly to the quantizer dynamic range 
problem. While in the experiment we have held the speech power 
fixed and varied A, we would have obtained the same distortions by 
holding A fixed and changing the speech signal level. It follows that 
the horizontal axes that we have labeled “clipping level” can, for a 
single quantizer, be renamed “speech level,’’ which increases from 
left to right. Figure 11 shows that a uniform quantizer has near- 
optimum signal-to-noise performance for only a narrow range of 
speech levels. By contrast, we see in Fig. 9 that listeners give nearly 
optimum ratings over a much wider range of input powers. 


IX. CONCLUSIONS 


Our results lead to several general observations regarding the subjec- 
tive evaluation of speech degraded by digital coding. First, our data 
indicate that when the degraded speech includes certain types of 
digital signal distortions, such as peak clipping, then total s/n is a 
poor objective indicator of subjective speech quality. For the coders 
we studied, a simple linear combination of two objective measures 
was a good predictor of the subjective quality of speech with quantizing 
and clipping distortions; however, we do not know of any single 
objective measure which would be a good composite indicator of 
subjective speech quality for all types and combinations of digital 
signal distortions. Second, because some types of digital signal distor- 
tions seem to be perceptually distinct, it seems unlikely that the 
subjective quality of digital speech can be evaluated by reference to a 
single type of analog or digital signal distortion, such as speech- 
dependent noise. And third, because coders are optimized by trading 
off different types of distortions, it follows that the important cases to 
study are those where distortions occur in combination rather than 
singly. This implies that knowing the relationships between subjective 
speech quality and various types of reference-signal distortions 
occurring singly—be they digital or analog—may be of limited value 
for predicting the subjective quality of coded speech if most practical 
coders produce speech degraded by combinations of distortions. These 
observations should be kept in mind by designers who must struggle 
with the problem of how various parameters of their coders affect the 
subjective quality of the speech. 
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Rain-Rate Distributions and 
Extreme-Value Statistics 


By S. H. LIN 
(Manuscript received March 17, 1976) 


An extension of a method of Chen and Lahlum} for estimating the 
distributions of high rain rates 1s described. Through application of the 
statistical theory of extremes to available yearly maximum rain-rate data, 
a reasonably accurate distribution is obtained. The calculated results 
agree well with previously obtained 20-year data. Application of this 
method to Weather Bureau Rainfall Intensity-Duration-Frequency Curves 
has yielded 50-year distributions of 5-minute rain rates for 36 locations 
an the western United States; others can be similarly obtained. Such 
long-term rain-rate distributions are valuable for microwave radio-path 
engineering, especially in the western United States where shorter-term 
data sources are inadequate. 


I. INTRODUCTION 


References 2 and 3 describe a procedure for obtaining long-term 
(220 years) distributions of 5-minute rain rates from data published 
by the National Climatic Center.* Such distributions have been ob- 
tained for 202 locations in the eastern and midwestern United States 
and applied to path engineering of 11-GHz radio links. 

However, the excessive short duration rainfall data* on which these 
distributions are based contain only rainfalls that exceed an excessive 
rainfall threshold defined by the National Climatic Center.?? For 
example, the threshold is 75 mm/hr for 5-minute intervals. In low 
rain-rate areas, such as Oregon and Washington, almost all rainfalls 
do not exceed the threshold and, hence, are not included in the excessive 
short-duration rainfall data. For example, at Spokane, Washington, 
the 5-minute rain rate exceeded the 75 mm/hr threshold only once in 
the 20-year period from 1953 to 1972. This data source, therefore, is 
an unsatisfactory basis for radio-path engineering in such areas. On 
the other hand, processing other longer-term data—say 50 years—is 
tedious and costly. This has motivated the search for an alternative 
method. Fortunately, the statistical behavior of the extremes of a 
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random variable has been extensively investigated.*- This paper de- 
scribes a method for obtaining distributions of high rain rates by 
applying this theory to the yearly maximum rain-rate data published 
by the National Climatic Center. 

In an unpublished work, Chen and Lahlum! have applied the 
theoretical distribution of yearly maximum 5-minute rain rates and 
an empirical extrapolation to obtain the rain-rate distribution in the 
range of interest to radio-path engineering. In this paper, we extend 
Chen and Lahlum’s method by incorporating the theoretical distri- 
butions of yearly kth largest 5-minute rain rates for k ranging from 1 
to 12. The application of the higher-order statistics of extremes elimi- 
nates the need for empirical extrapolation. 

In this paper, a 5-minute rain rate corresponds to the average value 
of the randomly varying rain rate in a 5-minute interval and is calcu- 
lated as AH /r, where AH is the 5-minute accumulated depth of rainfall 
and + = 1/12 hour = 5 minutes. For illustration, only the statistics of 
5-minute rain rates are discussed in this report. The method is also 
applicable to other integration times. 
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Fig. 1—Comparison of 20-year rain-rate distribution calculated by extreme 
statistics method (dashed line) with 20- and 22-year data (solid lines) for Newark, 
New Jersey. The difference between the 20- and 22-year data also indicates the 
instability of high rain-rate statistics. 
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Fig. 2—Comparison of 20-year rain-rate distribution calculated by extreme 
statistics method (dashed line) with 20- and 22-year data (solid lines) for Wilmington, 
North Carolina. The difference between the 20- and 22-year data also indicates the 
instability of high rain-rate statistics. 


Il. THE STATISTICS OF EXTREMES 


Let R be the randomly varying 5-minute rain rate and R; be the 
kth largest 5-minute rain rate in a given year. In other words, Ri is 
the yearly maximum 5-minute rain rate, R2 is the yearly second largest 
5-minute rain rate, etc. The value of R; varies from year to year, and 
the probability distribution of R, is the subject of the statistics of the 
kth extreme. 

Many sets of rain-rate dataU— indicate that rain-rate distributions 
in the moderate and low rain-rate region can be closely approximated 
by the lognormal distribution. In the tail region, the time bases are 
usually insufficient to yield stable results for testing the lognormal 
hypothesis. Figures 1 and 2 show instability at extreme values occurring 
in 20-year time bases. We will assume that the rain-rate distributions 
are lognormal and proceed to show that the calculated distributions of 
extreme rain rates agree well with the data as displayed in Figs. 1 
through 9. 

Let 

x=Ilnk, (1) 
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Fig. 3—Comparison of 20-year rain-rate distribution calculated by extreme 
statistics method (dashed lines) with the 20-year data (solid line) at La Guardia 
Airport, New York, New York. 


and 
Ly, = In Rx. (2) 


The lognormal hypothesis for F is equivalent to the statement that x 
is (approximately) normal. The distribution of the kth extreme, x;,, 
as derived by Cramer® is 

e7Ny 


k-1 
P(x 2x) le ne) NI? (3)* 





where 
y = a(X — 0) (4) 


and is called the reduced variate. In this expression, a and U are scale 
and location parameters, respectively, and are related to the sample 
mean and sample standard deviation of x1. Notice that the distribution, 
P(a, 2 X), for any k, is completely determined by the two parameters 
a and U. These two parameters can be calculated from the measured 


*The cumulative distribution function (3) is obtained by integrating the prob- 
ability-density function derived by Cramer in Ref. 8. 
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yearly maximum rain-rate data. Let 
Ri(9), j am 1, 2, 3, uty M (5) 


be the measured yearly maximum 5-minute rain rate in M years of 
measurements. For example, #1(7) represents the yearly maximum 
5-minute rain rate observed in the seventh year of an M year experi- 
ment. Let 


ti(j) = In [Ri(j)] (6) 
be the yearly maximum value of z in the jth year. From the measured 
data of x1(j), 7 = 1, 2, 3, ---, M, we can obtain an approximate 


distribution of x;. The parameters a and U can be estimated by fitting 
the theoretical P(x: 2 X) to the measured data. Gumbel® has shown 


that a least-square fit of P(x; = X) to the data leads to the following 


formulas for calculating a and U: 


aaa (7) 
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Fig. 4—Comparison of 20-year rain-rate distribution calculated by extreme 
statistics method (dashed line) with 20-year data (solid line) for Pittsburgh, Pennsyl- 
vania, and Chicago, Illinois. 
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and 


z Zz 
U = v1 a , (8) 
where 
1 ' 
t= ap a) (9) 
is the sample mean of 2, 
—_ 4s 
A= & Coa, (10) 
j=l 


oz = Vee — #2 (11) 


is the sample standard deviation of x, 


Z(j) =-In(-Ingp44), (12) 
7 I 
Z = ad Z(j), (18) 
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Fig. 5—Comparison of 20-year rain-rate distribution calculated by extreme 


statistics method (dashed line) with 20-year data (solid line) for Harrisburg, Pennsy]- 
vania, and New Orleans, Louisiana. 
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Fig. 6—Comparison of 20-year rain-rate distribution calculated by extreme 
statistics method (dashed line) with 20-year data (solid line) for Lynchburg, Virginia. 


2 1s "2 
Z “Hx [Z(9) F, (14) 


o, = VR — J’, (15) 


Thus, we have all the necessary formulas for calculating P(x, 2 2). 
To obtain the rain-rate distribution, we substitute (2) into (3) to yield 


and 


e-Ny 


N!’ 





k-1 
P(R, Zr) = l—ee*® Do (16) 
N=0 


where 


y = a[(Inr) — U]. (17) 


Therefore, the time that R, will exceed the threshold r, on long-term 
average, is 


T(R, >r) = 7 X P(R, 2 1) 


rx lee 


k—1 e-Nyu 


where 7 = 5 minutes is the rain-gauge integration time. Furthermore, 
in any given year, R;, and R, will never occur in the same 5-minute 


(18) 


Il 
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Fig. 7—Comparison of 49-year (1903-1951) distribution of 5-minute rain rates 
calculated by extreme statistics method with measured 20-year (1953-1972) data at 
La Guardia Airport, New York, New York. 


interval if k ¥ 1. This means T(R;, = r) for various order k can be 


summed to yield an approximation to the original rain-rate distribution 
in the extremal region; i.e., 


T(R 21) & = T (Ry = 1) (19) 


for high rain rates. 
The only input required for the calculation is the yearly maximum 
5-minute rain rates, 


Ri(J), J = i, 2, 3, ey M, 


which can be obtained from National Climatic Center publications.* 


ill. COMPARISON OF CALCULATED AND MEASURED RESULTS 


Figures 1 through 6 display the comparison of the distributions of 
high rain rates calculated via statistics of extremes from the data for 
eight locations. The number of years M is 20 (from 1953 to 1972). In 
these figures, the solid lines represent the data obtained by the method 
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described in Refs. 2 and 3, whereas the dashed lines are the distribution 
calculated by the theory of extremes. 

In Fig. 3, the value of S in eq. (19) is varied from 1 to 12. For S = 1, 
T(R, = r) is the distribution of yearly maximum 5-minute rain rate 
and is approximately equal to 7'(R = r) only in the extremal region 
(beyond 160 mm/hr). As r decreases, 7'(R; 2 r) deviates significantly 
from T(R = r), limiting at the 5-minute-per-year level as r approaches 
zero. The basis for this saturation is that there is only one yearly 
maximum 5-minute rain rate (with 5-minute duration by definition) 
in any given year. It is obvious that the yearly maximum 5-minute 
rain rate can exceed any threshold by no more than 5 minutes per 
year. Similarly, for S = 2, T(R, = r) plus T(R, 2 r) is limited to a 
10-minute-per-year level as 7 approaches zero. However, Fig. 3 shows 
that the applicable range of approximation (19) increases rapidly 
with S. 

For engineering terrestrial radio paths, we are interested in the 
range of rain-rate distributions below 50 minutes per year, because a 
single radio hop outage exceeding 50 minutes per year is considered 
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Fig. 8—Comparison of 50-year (1897, 1903-1951) distribution of 5-minute rain 
rates calculated by extreme statistics method with measured 20-year (1953-1972) 
data for Pittsburgh, Pennsylvania. 
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Fig. 9—Comparison of 36-year (1903-1912, 1926-1951) distribution of 5-minute 
rain rates calculated by extreme statistics method with measured 20-year (1953-1972) 
data at Midway Airport, Chicago, Illinois. 


undesirable. Figure 3 shows that using S = 12 in eq. (19) includes 
the range of interest to terrestrial radio-path engineering. 

Figures 1 to 6 show similar close agreement between the calculated 
results and the data. 

The advantage of using the extreme statistics method is its sim- 
plicity; i.e., it is much easier to obtain the extreme-statistics results 
from the set of 20 maximum yearly rain rates than to obtain the other 
displayed results which require analyzing all heavy rainfalls in each 
year. 


IV. FIFTY-YEAR DISTRIBUTIONS OBTAINED FROM RAINFALL 
INTENSITY-DURATION-FREQUENCY CURVES 

Rainfall intensity-duration-frequency curves published by the 
Weather Bureau!® were obtained from approximately 50 years (1900-— 
1950) of rain-rate data processed in accordance with the statistical 
theory for distribution of yearly maximum rain rate by the Gumbel 
method.* In other words, these curves represent P(R; 2 r), where Ri 
is the yearly maximum rain rate. The return period Q(fi 2 r), which 
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is the reciprocal of P(R; = 1), is labeled on each curve; i.e., 


Q(hi 


IV 


pS PRED years. (20) 
These curves cover the range-of-return period from 2 to 100 years and 
rain-gauge integration time 7 (i.e., the duration) from 5 minutes to 
24 hours. We considered only the 5-minute rain-rate statistics in 
this report. 

Since these curves represent P(R, = r), the two parameters a and 
U in eq. (17) can be estimated by fitting the theoretical P(R; = r) 
to two points on the intensity-duration-frequency curves. Once a and 
U are obtained, we can calculate a distribution of high rain rates by 
eqs. (18) and (19). However, one adjustment is needed in this process. 
Gumbel‘ has shown that as the number of years 17 approaches infinity, 
Z and oz in eqs. (13) and (15) approach the following asymptotic values 


lim Z = y = Euler’s Constant 


M—o ~ 0.5772 (21) 
lim o, = a0 22 
Pr a (22) 


The corresponding asymptotic forms for a and U are 


T 





li =a, = ; 23 
oe ae (23) 
and 
im US US a, (24) 
Mo Axo 


The rainfall intensity-duration-frequency curves were obtained using 
the asymptotic values, a, and U.,, even though the number of years 
M are 50 or less. Such approximations introduce slight errors and can 
be corrected by the following relationships among a., U.., a, and U: 


a= seco (25) 


1 Zar 
C= 0z +21, = == |. (26) 


To relate theoretical P(R; = r) with the intensity-duration-frequency 


data, we combine eq. (16) and (20) to give 


QR. = 1) = >a, (27) 


where 


y = a.[(Inr) — U,, ]. (28) 
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The two parameters, a, and U,,, can be determined by equating (27) 
to two sets of data, say (ra, Q2) and (7s, Q:), read from the intensity- 
duration-frequency curves. It is easily shown that the relationships are 


Aa — Ab 








Oo tae nae (29) 
and 
_ Aglnr, — Asln ra 
CC igea Ags? 
where 
Ag=-—In|{In Qe (31) 
a Qa se 1 ) 
and 
A, =—In [in Qs |: (32) 
Q —1 


By substituting a,, and U,, into (25) and (26), we obtain a and U for 
the 50-year data. Substituting a and U into (18) and (19) gives the 
50-year distribution of 5-minute rain rates. 

The time bases (i.e., 1) in the intensity-duration-frequency curves 
are mostly 50 years or less. However, several locations have time bases 
much shorter than 50 years. For example, the time bases are 18, 16, 
and 17 years for Mt. Tamalpais, California; Tonopah, Nevada; and 
Yakima, Washington, respectively. Due to this limitation, we have 
chosen Q, = 2 years and Q, = 10 years for calculations of a and U. 
Since Q, and Q, are fixed, we then need only three numbers: M, 1a, 
and r, for each location, read from the intensity-duration-frequency 
curves to calculate the rain-rate distribution. 


For example, for New York City, the three numbers are: 
M = 49 years (1903-1951) 
Ya = 4.4 inches/hr = 111.8 mm/hr 
rp = 6.5 inches/hr = 165 mm/hr. 
Substituting 7, and r, into eqs. (29) and (80) yields 
A. = 4.828 
U.. = 4.64. 


Substituting 17 into eqs. (12) through (15) and a, and U,, into (25) 


and (26) gives 
a = 4,363 


U = 4.63. 


The 49-year (1903-1951) distribution of 5-minute rain rate calculated 
from this a, U pair for New York City is very close to the 20-year 
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(19538-1972) data, as shown in Fig. 7. Figure 8 shows similar close 
agreement between 50-year and 20-year distributions in Pittsburgh, 
Pennsylvania. 

However, Fig. 9 shows an appreciable difference between 36-year 
and 20-year distributions for Chicago, Illinois. This appreciable 
difference, the irregular shape of the 20-year distribution in Fig. 9 and 
the instability noted in Figs. 1 and 2 indicate that a 20-year time base 
with a single rain-gauge measurement may not be sufficient to guaran- 
tee a stable distribution for extremely high rain rates. If more stable 
results are required, the 20- and 50-year data may be combined to give 
a 70-year distribution. 

The calculated curves (dashed lines) in Figs. 1 through 6 are based 
on 20-year data from 1953 to 1972 in Ref. 4, whereas the calculated 
curves (dash-dot lines) in Figs. 7 through 9 are based on approximately 
50 years of data in Ref. 16. 


V. CONCLUSION 


A method has been described for calculating the distribution of high 
rain rates by applying the statistical theory of extremes to the available 
yearly maximum 5-minute rain-rate data. Figures 1 through 9 show 
that the calculated distributions agree closely with the data in the 
heavy-rain region of interest to radio-path engineering. The virtue of 
this method is that only yearly maximum rain-rate data are required 
to generate satisfactory results for radio-path engineering. The rainfall 
intensity-duration-frequency curves!® provide approximately 50 years 
(1900-1950) of such data for 203 locations in the United States. 
Furthermore, a new publication on ‘‘Maximum Short-Duration Pre- 
cipitation,’’!” for approximately 300 U. S. locations issued annually by 
the National Climatic Center since 1973, provides additional yearly 
maximum rain-rate data. Therefore, long-term (250 years) distri- 
butions of high rain rates for 203 U. 8. locations can easily be obtained 
by this method. 
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Distinguishing Stable Probability Measures 
Part |: Discrete Time 


By B. W. STUCK 
(Manuscript received April 1, 1976) 


A sequence of N, independent, identically distributed, random variables 
ts observed from one of two stable distributions with known parameters. 
The likelihood-ratio test for discriminating between these two distributions 
is found explicitly and performance limitations are determined. 

When the two distributions differ only in location, the likelihood-ratio 
test is sensitive to whether the distribution is nongaussian stable 
(0 < a < 2) when nonlinear soft limiting of large deviations is used, or 
gaussian stable (a = 2) when linear processing is used. 

When the two distributions differ only in scale, the likelihood-ratio 
test 1s sensitive to whether 0 <a < 2 when nonlinear soft limiting of 
large deviations is used, or gaussian (a = 2) when a chi-squared test 
1s used. 

The analysis of the two remaining cases, distinguishing between one of 
two characteristic indices, and between one of two skewness parameters, 
parallels the analysis of distinguishing between one of two scale parameters 
and is only touched upon briefly. 


I. INTRODUCTION 


The problem of classifying a series of observations as coming from 
one of two or more possible classes or hypotheses has received a great 
deal of attention in the statistical and engineering literature. In many 
physical situations, a variety of disturbances corrupt the observations; 
rather than model each disturbance separately, it is often argued on 
physical grounds that the disturbances add and are independent, and 
the central limit theorem is invoked to model this sum using a gaussian 
distribution. This approach is adequate as long as the sum is not 
dominated by one or a few of the summands; if one or a few of the 
summands does dominate the sum, the disturbances can possibly be 
modeled as a stable distribution, one member of a family of probability 
distributions which includes the gaussian, by invoking a frequently 
overlooked generalization of the central limit theorem. 
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The gaussian distribution has enjoyed great popularity in hypothesis 
testing because it is analytically tractable and because it is the only 
stable distribution with finite variance. Although it may be argued 
that mathematical models with infinite variance are physically in- 
appropriate, this view conveniently overlooks the fact that the gaussian 
distribution is unbounded, which is also a physically inappropriate 
mathematical model. The gaussian model may adequately model dis- 
turbances over a narrow range of amplitudes; an infinite-variance, 
stable-distribution model may adequately model disturbances over a 
larger range of amplitudes. Both distributions may be physically in- 
appropriate mathematical models, but the infinite-variance distribu- 
tion may, in this sense, be the better model. This paper examines 
several stable-distribution hypothesis-testing problems. 

The primary motivation for this work on stable probability measures 
is drawn from a recent statistical analysis! of noise on various telephone 
lines. This analysis indicated telephone noise may be adequately 
modeled (on the lines examined) by a sum of sinusoids at various 
frequencies plus a purely nondeterministic random process that is well 
characterized by a stable distribution (either gaussian or nongaussian 
stable). Since only a small number of lines were examined, this analysis 
is preliminary, awaiting other independent investigations.* 

Indirect motivation for this work is drawn from detecting electro- 
magnetic signals at frequencies of 100 kHz or less. Noise at these 
frequencies is claimed to be nongaussian; unfortunately, adequate sta- 
tistical evidence to substantiate this claim is lacking, with one 
exception.’ 

A final source of motivation is found in financial problems. Over the 
last decade, a large body of statistical evidence has been amassed which 
indicates that the differences of logarithms of successive equally spaced 
prices of common stocks can be adequately modeled using stable 
distributions. * 4 


Il. OUTLINE OF DISCUSSION * 


A sequence of N random variables is observed; for simplicity, it is 
assumed they are independent and identically distributed—drawn 
from one of two stable distributions with known parameters (charac- 
teristic index 0 < ai S 2, skewness parameter —1 S @’ S$ 1, scale 
parameter 7’ > 0, location parameter —» < 6i< o;j7 =0, 1)? It 


* Applications of this work to removing telephone noise will be presented elsewhere- 

t These results were first presented at the Highth Annual Princeton Conference on 
Information Sciences and Systems, March 28-29, 1974, p. 405, and at the 1975 
on span Conference on Information Sciences and Systems, April 2-4, 1975, 
pp, 49-51. 

? Both subscripts and superscripts will be used to denote the stable-distribution 
parameters under hypothesis H;(7 = 0, 1); these parameters will be discussed more 
fully in Section III. 
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is well known that the likelihood-ratio test is a decision rule that is 
optimum with respect to either a Neyman-Pearson or Bayes criterion.® 
Here, the likelihood ratio is found explicitly and performance limita- 
tions of the test are determined. The extension of these results from 
two to M stable distributions is well known and will not be dealt with 
here.® 

The (log) likelihood decision rule, because of the independence as- 
sumption, takes the following simple form: 


A’ 


N A 
Dlr) & L 
t=1 Ho 
In p(rs3 a, 6, en 51) 
Drs; a°, 6°, 7; 5°) ; 
where {r;}7’ are the N observed random variables, drawn from a dis- 


tribution with probability density p(z; a/, 8%, y’, 6%), and L’ is a thresh- 
old. Since /(r;) can be rewritten as the sum of four functions, 


I 


L(1s) 


ae 1 Ql Al 5!) (ri; 2° B} vy} 5!) 
l(r; = Jn Pits @ BLY P\i; ’ ’ ) 
( ) PTs; a, p, ae 5') 5 a D(Ts5 a, 6°, a 5) 
P(r; a; By; 51) D(rs} al’, ee ¥; 5+) 
+ In Sa? 7? In RY Sa eT RET ATC 
P(r; Ape Y's 5}) * PTs; a as ar 5°) ; 


each of which tests for only one different parameter, this suggests 
studying each of these four situations separately. 

Two special cases are examined in detail: when the distributions 
differ only in location and when they differ only in scale. The proba- 
bilities of error of the first and second kind are found for three analyti- 
cally tractable cases (gaussian, Cauchy, and Pearson V) by calculating 
the characteristic function of the log likelihood probability measure 
induced under each hypothesis; the general case is apparently analyti- 
cally intractable, and quite expensive to tackle numerically at present. 
Exponentially sharp upper and lower bounds on both types of prob- 
abilities of error, and also the total probability of error, can be simply 
derived from the Laplace transform of the log likelihood probability 
measure induced under each hypothesis. These bounds are found 
analytically in three cases, and relatively inexpensive numerical results 
are presented for selected other cases. 

When the two distributions differ only in location, the likelihood- 
ratio test is shown to be extremely sensitive to whether the distribution 
is nongaussian stable (0 < a < 2), when nonlinear soft limiting of 
large deviations is employed, or gaussian (a = 2), when linear process- 
ing is used. When the distribution is nongaussian stable, performance 
is found analytically to be quite sensitive to whether a linear (sub- 
optimum) or likelihood (optimum) decision rule is used: the total 
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probability of error for the linear test behaves asymptotically (N > 1) 
as O(AN!~2), while the total probability of error for the likelihood-ratio 
test is upper bounded by exp(—BN + C), where (A, B > 0, C) de- 
pend on parameters of the two distributions and are independent of N. 
(For related work that complements the results in our discussion, see 
the list of references and particularly Refs. 6, 7, and 8.) 

When the two distributions differ only in scale, the likelihood-ratio 
test is extremely sensitive to whether the distribution is nongaussian 
stable when nonlinear soft limiting of large deviations is used, or 
gaussian when a chi-squared test is used. Performance for nongaussian 
stable distributions is extremely sensitive to whether a suboptimum 
(chi-squared) or optimum (likelihood-ratio) test is used: the total 
probability of error for the chi-squared test behaves asymptotically 
(N > 1) as O(FN-“2-D), while the total probability of error for the 
likelihood-ratio test is upper bounded by exp(—GN + H), where 
(F, G > 0, H) depend on parameters of the two distributions and are 
independent of N. 

The analysis of the two remaining cases, distinguishing between one 
of two characteristic indices and between one of two skewness parame- 
ters, closely parallels the analysis that distinguishes between two scale 
factors and is only touched upon here. 

The continuous time analogs of these discrete-time problems are 
studied, where a sample function from one of two stable, stationary, 
independent-increment processes is observed for a finite time interval 
in the second part of this work. In contrast with this work, the analysis 
is simpler, and it is possible to obtain many results analytically in 
closed form. 

Section III deals with various mathematical preliminaries. A brief, 
selective, tutorial overview of the central limit theorem, infinitely 
divisible distributions, and independent-increment processes is pre- 
sented to place this work in perspective (as well as to fix notation). No 
attempt is made to be exhaustive in the discussion. 

The length of the discussion is due to the many special sets of 
parameter values that must be taken into account to be thorough. The 
main reason for this completeness is to adequately cover all cases where 
uncertainty is modeled using a distribution arising from a central- 
limit-theorem type of argument. The main contribution here is the 
results per se, many of which are presented here for the first time, which 
unfortunately often involve either tedious algebraic manipulation or 
machine calculations. It is hoped this will not obscure the surprising 
(at first glance) nature of the results: the quite singular behavior of 
both the log-likelihood-ratio test and (perhaps more importantly) its 
performance, for the gaussian vs nongaussian stable distribution, in 
distinguishing either location or scale. The generalization of these two 
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results to a wide class of infinitely divisible distributions (which 
include the family of stable distributions) is immediate, and is sketched 
at the end of Section IV. 


I. MATHEMATICAL PRELIMINARIES 


The reader is assumed to be familiar with the fundamentals of 
measure theory and probability theory, as found in standard 
references.°—” 

Underlying the discussion to follow are: 


(t) The notion of a probability space: a triple {Q2, A, P}, where 
Q is the set of elementary events, A is a o-algebra of Borel 
measurable subsets of 2, and P is a probability measure on A. 

(ii) The definition of a stochastic process x(t, w) defined on a 
parameter set HL (henceforth called time), with ¢ GC HL, w € Q, 
which is a function mapping the direct product E# X Q into the 
real line, and the associated probability measure induced by 
x(t, w). 

(277) The measure theoretic concept of absolute continuity of one 
measure with respect to another, and the measure theoretic 
Lebesgue decomposition theorem. 


3.1 Infinitely divisible distributions and independent-increment processes 


In this section, various properties of infinitely divisible distributions 
and independent-increment processes are briefly reviewed. The inter- 
ested reader is referred to the literature for much more information.?—® 

This tutorial section serves several purposes: 


(<) It gathers together for convenient reference all material on 
stable distributions to be used in Part II. 

(2z) It fixes notation. 

(722) It emphasizes the central role played by stable distributions 
in understanding both the central limit theorem and the Lévy 
decomposition of the infinitely divisible distributions. 

(tv) Finally, it alerts the reader to the rich structure and variety 
of infinitely divisible distributions, in general, and stable 
distributions, in particular, in the hope that they will find 
greater use in modeling uncertainty. 


The characteristic function of a (first-order) probability distribution 
P(«)* is defined as 


C.(v) = [ eaP@) = EK (et) a8. 
*Upper case P(-) will denote a probability distribution, while lower case p(-) 
will denote the associated probability density function; all probability distributions 


examined here in any detail are absolutely continuous with respect to Lebesgue 
measure. 
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It can be shown that two probability distributions are identical if and 
only if their characteristic functions are identical (Ref. 14, page 28); 
thus, there is a one-to-one correspondence between characteristic func- 
tions and probability distribution functions. A random variable is 
said to be infinitely divisible if, for every natural number n, the random 
variable can be represented as the sum of n independent identically 
distributed (i.i.d.) random variables, or equivalently if its charac- 
teristic function can be written as 


Cz (v) => [C.(2, n) |" n= 1, 2; vey 


where C, is the characteristic function of some probability distribution 
which may depend on n. Two well-known examples of infinitely 
divisible random variables are the gaussian [taking values on 
(— 0, )] and the Poisson (taking values at nonnegative integer 
multiples of h): 





Gaussian: C,(v) ie em a exp{ — (a — m)?/20?} dx 


—% TO 


exp (imu — 40°?) 


Dy Ne™ (cist) = exp[A(eith — 1) 


I 


Poisson: C,(v) 





De Finetti conjectured that any infinitely divisible distribution could 
be written as the convolution of a gaussian and a generalization of the 
Poisson; the resulting characteristic function can be written as 


In C,(v) = imv — }o%? + | (e™ — 1)dF(u), 


where the measure F'(u) specifies at what points the Poisson variable 
takes on nontrivial values. However, this conjecture was shown to 
hold only for a subset of the infinitely divisible distributions by Lévy ; 
if one desires a canonical form of the characteristic function of an 
infinitely divisible distribution, then the following remarkable theorem 
can be proved (Ref. 13, page 76). 


Theorem (Lévy): Any infinitely divisible characteristic function can be 
uniquely written in the canonical form 





: ay Saar wu 
In C,(v) = t6v — $o?v? + (er —l1- rc z) dv_(u) 


+r i (er -—l- 1 S ) dvs(u), 


where 6 is a location parameter (— © <6 < ~), o% > O 1s the variance 
of the gaussian component, and (v_, vz) are called the Lévy measure of the 
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generalized Poisson distribution. The conditions the Lévy measure must 
satisfy are (2) v_ and vz are nondecreasing on the intervals (— ©, 0) and 
(0, ©), respectively, (41) v-(— ©) = v,(©) = 0, and (att) for every 
jinite « > 0, 


0—- € 
/ wdv_(u) < © | wdvi(u) < ©. 
—e 0+ 


Some examples now follow: 
Example 1 (Poisson) :6 = whd/(1 + h?),o? = 0,» = 0,-~ <u <0 


ee bs: O0<u<h,. 
SN Oe hae? 
In C,(v) = dA(e*4 — 1). 


Example 2 (Cauchy): o? = 0, 6 = 0, 


oes ee —-x<u<0 
|u| 

jose 0O<u< oo; 

+ TU ? 


In C,(v) = t6v — clo. 
Example 8 (Gamma): o? = 0, vr. = 0,-» <u<0O0 


oo et a4u 
6 =p I, Tae < @ 
dv,(u) = pe-%d(In u); 
C.(v) = (1- sy" 
m=(1-8 


Most of the attention here will be focused on one particular class of 
infinitely divisible distributions, the stable distributions. 


Definition: A probability distribution is said to be stable if, for all 
a; > 0, ae > O, bi, be, there exist constants a > 0, b such that 


P(ayxz + 6;)*P (aor + be) = P(ax + b), 


where * denotes convolution. In other words, stable distributions are 
closed under the action of the group of linear affine transformations on 
the real line. 

An important reason for examining stable distributions is found in 
the central limit theorem (Ref. 13, page 162; Ref. 15, page 168): 


Theorem: P(x) ts a limiting distribution for a sum of suitably scaled and 
translated, independent, identically distributed, random variables if and 
only if P(x) is stable. 
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In many practical problems, a large number of independent dis- 
turbances add and introduce uncertainty in a measurement. To 
analyze the effects of uncertainty, it is often convenient to replace this 
sum by its limiting distribution, which must be a stable distribution. 
The reader is referred to the bibliography for references on exactly 
what conditions govern the limiting distribution being gaussian vs 
nongaussian stable (Ref. 12, pages 171-190; Ref. 15, pages 165-169). 

Stable distributions are infinitely divisible; the associated Lévy 
measures can be shown to be v_(u) = c_|ul—%, vy(u) = —cyum* 
(Ref. 18, pages 164-168; Ref. 14, pages 128-133). Requirement (2) 
that the measure be nondecreasing leads to a > 0, while the final 
requirement (277) forces a < 2. Substituting this into the canonical 
representation of the characteristic function of an infinitely divisible 
distribution and explicitly evaluating the integral over the Lévy 
measure results in the following theorem: 


Theorem (Ref. 18, page 164; Ref. 14, page 186): The characteristic func- 
tion of a stable distribution can be expressed as 


—vlols [1+ i tan (3) | + idv a ~1, 
In E(e**) = 3 
—y|>| [1 + i972 210 Jr | ey eee 


whereO<aZ2, -1S5 8S 1,y>0(y =c%), —~ <6 < ~. For 
0<a<1p=c—a/e_et+a;folsas2,pB=e—-c/a+e. 
Note that for a = 2, the characteristic function, as a complex-valued 
function of v, is C®, but for 1 < a < 2,itisonly C',andfor0 <a <1 
is only C®. 

For fixed 6 (6 ¥ 0), the characteristic function is discontinuous (as 
a function of a) in the neighborhood of a = 1. One approach to this 
problem is to rewrite the characteristic function (a # 1) as 


In B (eo?!) = yp)" E als i8 [yy tan (3)] 


Ta 


+ iv (6+ v6 tan ge 7B tan ™ ) 


—y|v]* + tyBv tan “> [1 — joe] 
+ iv (s + 76 tan 2). 
If a new parameter 6’ = 6 + 76 tan (za/2) is defined, then for 6 fixed 


i ules = a—l a 
lim tan —>- [1 |v] 2-2] ~ in |v}. 
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By inspection, this form of the characteristic function is not discon- 
tinuous in the neighborhood of a = 1. 

Since the characteristic function is in L;(—, «), all stable dis- 
tributions are absolutely continuous with respect to Lebesgue measure, 
and have analytic probability density functions. Four parameters com- 
pletely specify a stable distribution: 


(t) a, the characteristic index of the stable distribution P(X ; a, 8) is 
associated with the asymptotic behavior of P(X; a, 8). For 
—-1<8<1,0 <a < 2, 


lim |X|¢P(-X)=k_>0, lim X¢{1 — P(X)] = > 0. 
X>— X00 


For 6 = —1 (a similar argument holds for 6 = +1), Lipschutz!* and 
Ibragimov and Linnik (Ref. 17, pages 62 to 66)* have shown that for 
l<a< 2, 
P(X) = Ofh(a)|X|*20-" expl—c(a)|X|e"]} as X— — 

lim X¢{1 — P(X) ] = ky > 0, 

X > 


while for 0 < a < 1, 


P(X) = O{k(a)X2?4-% exp[—c(a)X~#/-2]} as XL O+ 
lim X*{1 — P(X)] = ki > 0, 


where k(a), c(a) are constants which depend only on a. For the asym- 
metric Cauchy probability density function, it can be shown (Ref. 17, 
pages 57 to 60) that 


p(X;a = 1, 
B= —1)=0 | exp G [Al 2 exp(e|X1/2)) | BG a 
lim p(X;ea=1,8 = -1X* =k, > 0. 
xo 
(it) 8 characterizes skewness of the distribution: if 8 = O the dis- 


tribution is symmetric about x = 6. Otherwise, 


Kol = Pye, 8) P(X pepe) 
: P(—X; a, 8) _1+8. 
n=O ea) Lae 
For 1 < @ < 2, the distribution is skewed to the left for —1 S$ 6 < 0, 
since P(6) < 1 — P(6), with the degree of skewness increasing as 8 


B 


* Note typographical errors in eqs. (2.4.30) and Theorem (2.4.7), of Ref. 17. 
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decreases. It suffices to consider varying 6 over one half its range be- 
cause from the characteristic function it follows that the probability 
density p(x) obeys the relation 


p(x; a, B, 7,6 = 0) = p(—-z;3 4, —6, 7,6 = 0). 


(zit) y(or c A y'/*) is a measure of the dispersion or spread of the 
distribution. 

(tv) 61s a location parameter, and for 1 < @ S 2, 6 is the mean. 

Only three analytic closed-form expressions for stable probability 
density functions are known at present: 


Gaussian (a = 2, -1 = 6B £1): 
1 zx-—s6\ 
(Ore Pra) | OSE 
Cauchy (a2 = 1,8 = 0): 
p(t) =<[@-R+APR  -e <a< a; 


Pearson V (a = 34,8 = —1): 
1 


ve ={°ge (2s) [-sety] #28 
0 x<d 





and its conjugate density 
p(t;a = 2,8 = 1, 7,6 = 0) = p(—a,;0 = 3,8 = —1, 7,5 = 0). 


Series expansions are known for the remaining stable density functions 
(Ref. 14, pages 138-148) : 


p(x; a,8,y = 1,6 = 0) 


eG) 


T k=l k! 
p(z;a,8,y = 1, 6 = 0) 


ntl sin 2 (9 ~ a) eee 
2a 





2 — k 
Tike k! 2 
p(x; Qa, pS Y = 1, 6 = 0) 
1 2. t=1)% 2 . _ : 
= — 7 ak I é*{sin (1 + B)é}e—@F/*)#1n fgg a= 1, 
wk=0 &k! 0 
where 


tan (67/2) = 6 tan (2a/2), and x> 0. 


*For asymptotic expansions for a = 1, see Ref. 17, Theorem 2.4.3 and Ref. 18. 
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The reader can check that the series for a = 2 reduces to the series for 
the gaussian, and the series for 0 < a < 1 and |8| = 1 are zero ona 
half line (cf. Pearson V). For (0<a<1, ~1<6< 1) and 
(lS aS 2, -—1 S$ 8 S 1), stable probability densities have support 
on (~«, «), The series expansion for the density for 0 < a < 1 can 
be used as an asymptotic expansion for the density for 1 < a < 2 for 
|8| + 1. It can be shown from the characteristic function directly that 
all stable distributions are unimodal (Ref. 18, pages 158 to 161; Ref. 
17, pages 66 to 76). 

Figure I is a plot of various stable probability density functions for 
fixed a (1 < a < 2) and several 8; for a near two, it is quite difficult to 
distinguish symmetric (8 = 0) and asymmetric stable distributions. 
Figure 2 shows that around the mode, all stable distributions appear 
roughly gaussian, for 1 < a < 2 (note the logarithmic scale). 

For @ in the neighborhood of two, the gaussian and nongaussian 
stable distributions are virtually identical around their mode, and it is 
only in the tails of these distributions that the differences are pro- 
nounced. One crude measure of the point at which the gaussian and 
nongaussian stable distributions diverge is the point at which the first 
term in the asymptotic series (a < 2) equals the gaussian density: 
for a = 1.90, 1.95, 1.99, this occurs at 3.342, 3.635, 4.158 gaussian 
standard deviations, respectively. 

One reason stable distributions have attracted little attention in the 
mathematical modeling of uncertainty is found in the theorem from 
Ref. 14, page 169: A stable distribution with characteristic index a has 
all absolute moments of order p,O<p<a <2: H(|2|?) < ©. Con- 
versely, (|x|?) does not exist, i.e., it diverges, for p = a,a < 2. 

This suggests (albeit heuristically) that stable distributions may find 
application in modeling uncertainty when, as the number of observa- 
tions increases, for 0 < a <1, both the sample mean and sample 
variance ‘‘wander erratically,” being dominated by one or a few ob- 
servations, while for 1 < a < 2, the sample mean stabilizes but the 
sample variance does not [cf. Refs. 1, 2, 3, 4]. 

The generalization of these ideas from discrete time sequences of 
independent, identically distributed, random variables drawn from an 
infinitely divisible distribution to continuous time sample functions 
of an independent increment process is clear. The characteristic func- 
tional of a stationary independent increment process can be uniquely 
written as 


In Ee O—209)] d 


= (f — s) | iv — $o%? + ¥ (ce —-l- i 3) dv_(u) 
ay A wu 
+ ‘es (« —l- 4) dv.(u) | 
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CHARACTERISTIC INDEX a= 1.1 
SKEWNESS PARAMETER 6 


———— B=-0.75 


—-——- 6 = —0.25 
——— f= 0.0 


p(X) 





CHARACTERISTIC INDEX a= 1.3 
SKEWNESS PARAMETER 6 


———= 6=-0.75 
» p= —0.50 
0.4 


0.0 
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x 


Fig. 1—Stable probability density functions te = 1.1(0.2)1.7, 8 = —0.75(0.25)0.0]; 
scale factor c = 1.0; location parameter 6 = 0.0 
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for0 S$ s <t < T. The parameters 4, o?, and (v_, vi) have been defined 


already. In words, any independent increment process can be decom- 
posed into 
(z) A singular piece, called the drift, specified by 6. 


0.4 
CHARACTERISTIC INDEX a= 1.5 


SKEWNESS PARAMETER £6 


—-——=— p=-0.75 
= -0.50 


0.3 





xX 02 
Qa 
0.1 
CHARACTERISTIC INDEX a = 1.7 
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Sp 2-075 
x 
Qa 





Fig. 1—(continued) 
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CHARACTERISTIC INDEX a 
SKEWNESS PARAMETER £6 = 0.0 


meme Gg = 1.1 
momen 0=15 


mm = 1.9 


LN p(X) 





CHARACTERISTIC INDEX a 
SKEWNESS PARAMETER £6 = —0.25 


mmm = 15 


Sa 1,9 


LN p(X) 





—30 -20 -10 0 10 20 30 
X 


Fig. 2—Stable probability density functions (semilogarithmic) [a = 1.1(0.4)1.9, 
B =—0.5(0.5)0.5]; scale factor ¢ = 1.0; location parameter 6 = 0.0. 
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(21) A gaussian component, a component with continuous sample 
paths that have unbounded variation with probability one 
(w.p.1), specified by o?. 

(iz) A generalization of the Poisson process called a jump process, 

2 


CHARACTERISTIC INDEX a 
SKEWNESS PARAMETER £ = —0.50 


—2 


—4 


LN p{X)} 
| 
oO 
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a=19 


LN p(X) 
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Xx 


Fig. 2—(continued) 
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with sample paths that are constant except for simple jump 
discontinuities at random times with random amplitudes, 
specified by the Lévy measure (v_, v4). 


A (separable) pure jump process, a stationary independent increment 
process with no gaussian component, has sample functions that are of 
bounded variation* with probability one if and only if 


[ular + fi udv,(u) <0, 


An example of an independent increment process with bounded varia- 
tion (w.p.1) is a stable independent increment process (0 < a < 1) 
while stable independent increment processes (1 S a S 2) have un- 
bounded variation (w.p.1). The intuitive meaning of the Lévy measure 
is that first proposed by De Finetti: the Lévy measure specifies the 
density of the amplitudes of the jumps of the Poisson process, provided 
the process sample paths are of bounded variation (w.p.1). 

By allowing 6, o?, and (v_, v,) to depend upon time, a time-varying 
generalization of infinitely divisible distributions or nonstationary in- 
dependent increment processes is obtained. By examining nonanticipa- 
tive functionals of either a discrete time sequence of i.i.d. random 
variables drawn from an infinitely divisible distribution, or a con- 
tinuous time independent increment process, a wide variety of Markov 
processes are derived. Thus, the generalizations of the results presented 
here to many other situations may sometimes be immediate. The 
richness of this class of random processes suggests these results may 
find wide application. 

Historically, the mathematical study of independent increment 
processes concentrated first on the gaussian case, then on the stable 
case, and finally on the general case. To date, most of the engineering 
literature has concentrated on the gaussian case or the purely Poisson 
case, with the notable exception of Frost.!® It is hoped this work will 
suggest promising avenues of constructive research by studying the 
stable case, as well as shedding light on some of the quirks of the 
gaussian case. 


IV. DISCRETE TIME DETECTION OF TRANSLATES OF STABLE MEASURES 


One of two sequences of independent, identically distributed (1.i.d.), 
stable, random variables is observed, under one of two hypotheses 
(Ho, Hi): 1 x 

1 Th = 8 Nk <hRe< 
Hy re = O+ nr, Late): 


* The variation of afunction f(t),0 <t <T, is defined assup 1./,! | fiz) — FCs) | 
where the supremum is over all possible partitions of the interval [0, 7]:0 = to <t 
<eee<ty = T. 
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The observed or received sequence is denoted {r;}?, while {nz}? is a 
sequence of i.i.d. stable random variables with known parameters 
(a, B, y, 6 = 0); both s! and s° are known. The a priori probability of 
H; is denoted 7; (j = 0, 1). (The extension of allowing s}, s° to depend 
on & is immediate and is not dealt with here.) 

The measures induced by {r,} under Hy and H, are clearly not 
mutually orthogonal. Two cases occur: for (0 << a<1, -—1 <8 <1) 
and (l Sa S 2, —1 S$ B S 1), the stable measures have support on 
the whole real line, and hence are equivalent. For (0 <<a<1,8 =1 
or —1), the stable measures have support on a half line, and hence one 
measure is absolutely continuous with respect to the other but not 
vice versa: the supports of the two measures overlap except for the 
interval [s°, s'). In either case, since the measures are not mutually 
orthogonal, the decision rule, which as is well known minimizes both a 
Bayes criterion as well as a Neyman-Pearson criterion, is the likeli- 
hood-ratio test.> The goal is to find the exact form of this test, and 
characterize its performance.* Performance here means calculating the 
probability that H, is chosen given that Ho is true, and the probability 
that Hy is chosen given that H, is true; these are called probabilities 
of error of the first and second kind, and are denoted Pio and Po, 
respectively. A quantity which is also of interest is the total probability 
of error, defined as (roPi0 + m1P 01) = Pu. 


4.1 The likelihood-ratio test 


The structure of the optimum detector is handled in two separate 
cases. First, when (Q0<a<1, -1<6<1) or Sa 2, 
—1 < 8 S 1), the likelihood ratio is always strictly positive and finite, 
and is 


_ ’ = Prlti — 81) § es 
A= A(ri, -, ty) = yy = Dall; = 9) FF, 2 L, 


where p,(-) is the probability density of n,. An equivalent test is to 
compute the log likelihood ratio, 
N mM 
=InA=DX]l(r) 2lnL=L', 
i=l Ho 
where 


= Palri — 8!) 
L(r;) = In ” onlt; — 8)? 


and this can be explicitly calculated using the series expansions de- 
scribed earlier. Before doing so, it is worthwhile to examine two 


* A discussion of the power of this test (or any other test) is deliberately omitted. 
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analytically tractable cases: 


Gaussian (a = 2, -1 S68 S11): 


ett we ae < w; 





Pr(x) = 


4ac 


U(r) = — Fa L(re- 8) - Oe - 97] 


a LBralst — 8) — (+ (9); 


> 


aia e Ln- ( i) aoe (s] 2 In Par 


The log likelihood test can be implemented using only linear process- 
ing. The rule has the interpretation of comparing an energy-like 
quantity, the received signal suitably translated and squared, with a 
threshold. Equivalently, the test defines a hyperplane in R¥, and de- 
pending upon which side of the hyperplane (ri, ---, rv) lies, Hi or Ho 
is chosen. All of this is well known (see Ref. 5, pages 94-97 and 163- 
173). 


Cauchy (a = 1, 8 = 0): 


pa(t) ={@+ey% 2 <e<o; 


ea RES 8? See 
L(r;) = In (7, — 8)? +4 c2? 
r a lala a ’ 
a’ = Dn Een nL = L’. 
Unlike the gaussian case, the Cauchy log likelihood detector operates 
nonlinearly on the observation. A straightforward Taylor series ex- 
pansion of the log likelihood about r; = $(s! + s°) shows that for small 
perturbations about this point the log likelihood is linear in the perturb- 
ing quantity. On the other hand, for large excursions in any one 
observation, 


r; 


— gl , 
i Ss > p ue 


— 
A as ae 














this one term in the sum behaves as O(r;‘) or, in other words, very 
large excursions in the received signal are essentially (but not entirely) 
discarded; this type of behavior will be called soft limiting. Only for 
N = 1 does this test reduce to finding a hyperplane and determining on 
which side of the hyperplane the observation lies in order to choose 
H 1 Or H 0- 
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The cases (0 <a <1,-1<8<l1)and(l1SaS2,-1<8 <1), 
can now be examined; it is a straightforward exercise to substitute into 


the log likelihood the series expansions for stable probability density 
6 


CHARACTERISTIC INDEX a= 1.1 
SKEWNESS PARAMETER £8 


=-0.5 
4 —--- B=00 
—-—-— B= +05 


LOG LIKELIHOOD 
S) 


-2 


A 





-6 


CHARACTERISTIC INDEX a= 1.5 
SKEWNESS PARAMETER 8 
B=—-0.5 
——-— 8=0.0 
— sn oe B=+0.5 


LOG LIKELIHOOD 





—30 ~20 —10 0 10 20 30 


Fig. 3—Representative log likelihood functions (s!' =+10, s®? = 0) (a fixed, B 
varying) ; scale factor c = 1.0; location parameter 6=0. 
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CHARACTERISTIC INDEX a = 1.9 
SKEWNESS PARAMETER 6 


g=-0.5 
——— B=0.0 
—-—- B=405 


LOG LIKELIHOOD 





Fig. 3— (continued) 


functions. Figures 3 and 4 show various representative log likelihood 
ratios [l(r;)] for (1 <a <2, —1<8< 1) with a fixed a and B 
varying; Fig. 5 shows the same log likelihood ratios as in Fig. 4 with 
@ fixed and a varying. Similar results hold in the remaining cases 
(0<a<1,-1<6 <1). 

Three points are emphasized here. First, the structure of the optimum 
(log likelihood) detector is very sensitive to whether the underlying 
distribution is gaussian or nongaussian stable; this is not surprising, 
because small perturbations away from a = 2 result in a singular 
perturbation in the probability density function.* Second, when the 
observation is in a neighborhood of 4(s° + s!), an identical Taylor series 
argument, as used in the Cauchy example, is applicable, and small 
perturbations about this midway point result in linear perturbations 
about the corresponding log likelihood point. Third, when large 
excursions occur, 


r; — 8° rT; — sl 


> 1, Si 














the (log) likelihood for this term behaves as O(7r;-'), which follows from 
asymptotic expansions. 


* However, stable distributions in the neighborhood of a = 2 are all close with 
respect to the topology induced by any reasonable metric, e.g., Prokhorov’s metric. 
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The first two points in the preceding discussion hold for (1 S$ a < 2, 
|8| = 1). The third point must be slightly modified (assume now 
8 = —1, since a similar argument follows immediately for 6 = 1): 
l(r;) ~ O(r7') for r; > 0, but for r; < 0, I(r) ~ O(— |r; |") (ef. 
gaussian case) (1 < a < 2), while for r; < 0, a = 1, 

L(r,) ~ O[—exp(x |r; | /2)]. 

It remains to consider {n,}%, a sequence of i.i.d. stable random 
variables with (0 < a < 1, |8| = 1). Assume from here on 6 = —1, 
s! > s°, The likelihood ratio is thus zero or strictly positive and finite, 


and the log likelihood is either minus infinity or finite. First, consider 
the Pearson V distribution as an example: 


Pearson V (a = 3, 8 = —1): 











mis 
c—I[- exp| —c/2x x20 
rt) = | ce) epl~</2e] 
0 x <0; 
3 rv; — si Cc 1 1 
panies be we | Ge Fe 4 .> el 0 
l(r;) = in (2=5) 5| ++ 5 | ee ees 
— 0 si>r;2 8° 
v 3 r;— si c 1 1 Hi 
pe ee v oon _ > , 
. al pin (=) s| a —3| 22 
7,2 s> 9? 
for all 4, 1sitN, 
A’ = —o (choose H,) if s' > r; = 8° 
for some 2, 1sisSun. 


If all the received signal samples are greater than s!, the optimum 
test is to compute the log likelihood and compare it with a threshold to 
choose H, or Ho. Note that for (r; — s!)/c > 1, l(r;) decays asymptot- 
ically as O(r;'), and thus large deviations are weighted lightly. For 
r; > s', (r; — 8s!) Ke, l(r:) ~ (r; — s')“!. If one or more observations 
fall in the interval [s°, s'), the optimum rule is to choose H». 

The remaining cases (0 < a < 1 and 8 = —1) can be treated in an 
identical manner, using the series expansion for the densities. The im- 
portant points are (z) the optimum detector is fundamentally non- 
linear; for (r; — s')/c > 1, l(r,;) decays as O(7;'), (22) if any observa- 
tion falls in the interval [s°, s!), the optimum strategy is to choose Ho, 
(iit) for r; > s!, [reg — 8!| Ke, l(rs) ~ OL (ri — st) O- © 1. 


4.2 Performance limitations 


To complete the solution of the problem, the probabilities of error 
of the first and second kind must be calculated. This appears to be 
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quite difficult in the general case of an arbitrary stable distribution 
and bounds are developed in Section 4.3. In this section the per- 
formance of the optimum (log likelihood) detector is found explicitly 
for the three analytically tractable stable distributions to illustrate the 


8 


CHARACTERISTIC INDEX a= 1.1 
SKEWNESS PARAMETER 8 


6 B= 0.5 
—-—-—- B= 00 
rm B= 405 


LOG LIKELIHOOD 
° 


-4 


—6 





CHARACTERISTIC INDEX a= 1.5 
SKEWNESS PARAMETER 8 


B= -0.5 
---—-— = 00 


LOG LIKELIHOOD 





—30 -20 —10 0 10 20 30 


r 
Fig. 4—Representative log likelihood functions (s! =+10, s° =—10) (a fixed, 6 
varying) ; scale factor c = 1.0; location parameter 6 = 0.0. 
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CHARACTERISTIC INDEX a= 1.9 
SKEWNESS PARAMETER g 


g=-0.5 
—--—-— p= 00 


LOG LIKELIHOOD 





Fig. 4— (continued) 


problems that must be addressed in the general case. The approach 
adopted is to calculate the characteristic function of the log likelihood 
probability measure induced under either H, or Ho. 


Gaussian (a = 2, -1 S$ 8B S 1) 


Section 4.1 showed that the log likelihood ratio is 
, 1 a 1\2 0) 2 
A= — 7 2 Eers — 8) (ee Fl ey 


and since the log likelihood is a sum of 1.i.d. random variables, its 
characteristic function can be found by using elementary Fourier tech- 
niques. The results are: 


ror N(s! — 8°)? -. 
In E(e A | 1) = == gap =e = v? | 
— o\2 
In H'(e*4’| Ho) = eG? [—i — v?]. 


Using the Fourier inversion lemma, the density of the log likelihood 
under either hypothesis can be found in closed form to be 


1 ' 
A’|H,;) = ——e —(A’ — 8,)?/4c” -o <N’<ow 
p( | ) Vag oe ( 1?/4c"? | 
j = 0,1, 
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CHARACTERISTIC INDEX a 
SKEWNESS PARAMETER 8 = 0.0 


a=19 


mom a= 15 


LOG LIKELIHOOD 


Hy: r=sltn 
Ho: r=s? +n 
s'=10 

s0 = —10 


CHARACTERISTIC INDEX a 
SKEWNESS PARAMETER 8 = 0.5 


a=1.9 


wee gH 15 


LOG LIKELIHOOD 


Hy: r=st+n 


Ho: r=s? +n 
s1=10 
s0 = —10 





—30 —20 —10 0 10 20 30 
t 


oo 5—Representative log likelihood functions (s' = +10, s° = —10) (@ varying, 
B fixed). 
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where 
[st — 9°|VN 


8; —59 = De 


N ; 
= 4¢2 (st = 8), Os 


The probabilities of an error of the first and second kind are 


Pi) = Pr [choose H,| Hp true] = [° pa! oan’ 
L¢ 
_il L' — 5\ _ tly L' — 6 
= 5erte (= Ne 5{t- - ( 


8) 
=E-eTe eT 


Po. = Pr [choose Ho|H; true] = 1 — 5 erfc (= 


~ f+ (See) ool (HH) 
fob(2x4)]] 


where erfe (-) is the complementary error function (Ref. 20, eq. 7.1.2) 
and ,F, is a hypergeometric function (Ref. 20, eq. 7.1.21; see also 
Slater, Ref. 21). 


Cauchy (a = 1, 8 = 0)* 
It was noted previously that the log likelihood ratio can be written as 


(rs — SP +e 
(ri — IP + 2 


The characteristic function for the log likelihood can be found just as 
for the gaussian case: 


E(e*4"| A) 
Ps 20 (7; — 2 +e edr;/ 
= [> [oo 2 Gaoree) Masdrre 


ea ag 





= 3° 





bol oo 








=> 


ue “Cee + ak + ATG — a + ep (2) arf", 


where 
A = 3(s! — 8), t= 17; — 3(s' + 8°). 


* The following analysis was suggested to the author by S. O. Rice; any errors in 
the development here are the responsibility of the author alone. 
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It now helps to realize 
; 2Ax 
2 2— (72 2 2 alae 
(c+ A)? +c (a? + A +A (lta eye) 


so that the characteristic function can be written as 


ad Anz iv DAxr —itv—-1 
nowt) =| ["tare¢e) -apare) 
(< apie) 
7 Re ee oe EU Su 2Ax a 
=| fe n ) (Era) 


c F dix iN 
(£2) (-» Te | 
Only even powers of (m + n) contribute to the integral. This observa- 


tion can be combined with the definition of the beta function (Ref. 20, 
eq. 6.2.1.) to show that 


Be |i) = [Varta 5, 5,(2)(“* *) 
(4 yom een (EE) 








A? + ¢ T Tim + n+ 1) 
Substituting (m + n) = 2I, and using the identity (Ref. 20, eq. 6.1.18) 


Td+3)_ 2a 
T(2l) = 4'T(1)’ 


results in the final form of the log likelihood characteristic function 
assuming 7, is true, 


nowy « [- [rea (225) 


= N 
A aye F1(—21, wv +1; — +1; -1)| 











The term 
T'(—w + 21) 


(“Oar = T'(—1) 


is standard notation for Pochhammer’s symbol (Ref. 20, eq. 6.1.22). 
A similar expression results for the characteristic function of the log 
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likelihood, assuming the other hypothesis is true: 


on 2 aa | } 
Hees y= ar eee 


(225) wr—2, iv; ~tv — 21; -1 |" 


Since these series converge for all v (— 2 <v < ©), as well as for all 
(finite) values of A and c, the Fourier inversion lemma guarantees that 
a unique inverse to these transforms exists, and thus in principle the 
density of the log likelihood under either hypothesis is known and the 
probabilities of error of the first and second kind can be calculated. 
Numerical results are presented in a later section that were arrived at 
in exactly this manner. 

Several additional observations can be made. For N = 1 the log 
likelihood is a random variable whose distribution has compact support 
on the interval 


NAN ee Ae sa’-(*$*) sinters 
A Mice a 4 +ce+A 2 A+ oe — 


and thus the support of the log likelihood distribution for any finite 
number of samples, say N, is on the closed interval 





In 


Nin VAR SA <A! -w(2$*) swmtotet. 


“qMepe ek VA+e—A 


Since the log likelihood distribution has compact support, it is well 
known (Ref. 22, p. 121) that its Fourier transform has support on the 
entire real axis. The second observation concerns the asymptotic 
(v > 1) behavior of the characteristic function of the log likelihood. 
Since the saddle points of the log likelihood characteristic function 
are at +vA? + c?, stationary phase arguments® show that asymptot- 
ically (v > 1): 


= 7 é (x + AP +e c dx 
= Le exp ( ion (e — A)? + a (;) (@ — A+ a | 
CF : : VA? +e+A 7 

~| a (A? + c?)# {exp (win eee 4 eae - +5) 


Aa ey. i t\ |* 
+ ex (im ea) to). 


so that asymptotically the characteristic function decays as |v|—%/2. A 


E(es’ 
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similar result holds for the log likelihood characteristic function as- 
suming Hy is true. 

An alternate approach is to calculate the Mellin transform of the 
likelihood probability density (for N = 1), then raise it to the Nth 
power and find the inverse transform; this was investigated without 
success. A direct approach, convolving the probability density of the 
log likelihood with itself N times, was also attempted; the resulting 
integrals were intractable. 


Pearson V (a = 3, 8 = —1) 
Assuming H; is true, the characteristic function of the log likelihood is 
E(e*4’| H,) 


ir dE ea 



































where A = 3(s! — 8°), x = r; — $(s! + 8°). All attempts to simplify 
this expression were unsuccessful. Stationary phase arguments show 
that asymptotically (v > 1) 


B (ei | Hy) ~ [Vg t ex (ots + if) +0(5)", 


where (ki, ke, k3) are complicated functions of (c, A). 

An attempt was made to find E(e*”4’|H), assuming no observation 
occurred in the interval (s°, s'); this approach encountered the same 
problems as finding H(e**4’| H,), and was unsuccessful. 

It is worth noting that the log likelihood has only one maximum on 
the interval (s!, ©), for H(e4’|H;)(j = 0, 1), and hence only one 
stationary point enters into the stationary phase asymptotic expression 
for E(e4’|H,;). It can be shown this behavior is typical of any asym- 
metric (|8| = 1) stable distribution. In contrast, the log likelihood has 
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two maxima for any stable distribution (—1 < 8 < 1), and hence two 
stationary points (cf. Cauchy). 

Neither the use of Mellin transforms (instead of Fourier transforms) 
nor convolving the log likelihood density with itself N times made the 
problem any more tractable. 

In the case of an arbitrary stable distribution, it appears quite 
difficult to find the density of the log likelihood by calculating the 
characteristic function of the log likelihood probability measure in- 
duced under either H; or Ho, because only series expansions are known 
at present for stable probability density functions (except for the three 
cases covered here). Even resorting to numerical approximation tech- 
niques poses some quite difficult problems: for0 <a <2,-15 881 
(as for the Cauchy and Pearson V distributions) the log likelihood 
characteristic function has its support on the entire axis, and oscillates 
and decays asymptotically as O[(e”¥°/vv)* ] from stationary phase 
arguments.* To accurately approximate numerically the probabilities 
of error of the first and second kind from the log likelihood characteristic 
function, the characteristic function must be approximated and stored 
at a great many frequencies, and the total cost (especially due to 
storage) can be quite high. Furthermore, one would like to carry out 
calculations for many different values of (a, 8, y, 5). The storage cost 
plus the large number of parameter variations often desired can make 
this program quite expensive at present. 


4.3 Analytic performance bounds 


Because of analytical and numerical problems encountered in ex- 
plicitly calculating the probabilities of errors of the first and second 
kind, as well as the total probability of error, bounds on these quantities 
were investigated. 

Let P; and Po be probability measures defined on the same measure 
space (Q, A). For 0 < q < 1, define 


_ (aPx\* ( aPo\™* 
ha(Ps, Po) = (S2)" (S22) 
where yu is any measure defined on (Q, A) such that uw > Pi, un > Po. 


(An example of such a uw is uw = Po + P,.) This definition of h, is seen 
by inspection to be independent of yu. Define 


H,(P:, Po) = : dhq(P1, Po) 


* Different contours of integration (e.g., path of steepest descent) were investigated 
without success. 
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as the Kakutani inner product of Py with P; (Ref. 24); the classical 
Hellinger integral is a special case of the Kakutani inner product, and 
is defined as Hi(P,, Po). It is known that 


05 4,(Pi, Po) = 1, 


with H, = 1 iff P; = Py a.e. The Kakutani inner product can be 
thought of intuitively as the amount of “‘colinearity”’ or ‘‘overlap”’ of 
two probability measures, with the larger the Kakutani inner product, 
the larger the “‘overlap.’’ A number of useful properties of the Kakutani 
inner product are summarized in the following easily proven lemma?*}; 


Lemma: (1) Po and Py, are mutually orthogonal (denoted Po L Pi), 
© H,(Po, Pi) =O Sh,(Po, Pi) = 0 
(2) If0 <q <1, Hq(Po, Pi) is continuous in q. Four cases 
determine the behavior of H,(Po, P1) at q = 0, 1: 


(2a) If Po and P, are equivalent, then H,(Po, P1) is continuous 
atq = O0andq = 1. 

(2b) If Po is absolutely continuous with respect to P, but not vice 
versa, then H,4(Po, P1) ts continuous at q = 1 but not at 
q=0. 

(2c) If P, is absolutely continuous with respect to Po but not vice 
versa, then H4(Po, P1) is continuous at q = 0 but not at 
q=1. 

(2d) If Po and Py are neither mutually orthogonal nor equivalent, 
then H4(Po, P1) is discontinuous at gq = 0, q = 1. 

(3) H,(Po, P1) and its logarithm are convex functions, 0 < q 
<1. The convexity is strict iff (dP1/dPo) (x) is not constant 
for all x € supp(Po) () supp(P1). 


It is instructive to rewrite H,(Po, P:) in two different ways to ex- 
plicitly show the relationship between the log likelihood functional and 
the Kakutani inner product: 


(i) H,(Po, P:) = / exp{qIn (dP1/dP»)}dPo 
= E{exp [qIn (dP1/dP») ]| Ho}, 
(it) Ho(Po, P) = / exp{(q — 1) In (dP1/dP»)}dP; 
= Efexp [(q — 1) In (dP1/dPo)]| Hi}. 


(t) and (72) are the Laplace transforms of the log likelihood probability 
density (also called the moment generating function of A), evaluated 
at q and (q — 1), and assuming H, and Hj are true, respectively. It is 
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196-197). Using Hélder’s inequality, it is straightforward to show that 
the logarithm of H, and, hence, H, itself, are convex functions of gq, 
0<q<l. 

Chernoff*® was apparently first to use H,(Po, Pi) (where Po <u, 
P, <u, » = Lebesgue measure) to upper bound the probabilities of 
error of the first and second kind, and his work has found widespread 
application in the engineering and statistical literature (see also, Ref. 
14, pages 517-520 and the references therein). 

In the notation used here, Chernoff showed 


Po. < inf H,(Po, Pen’ 
0<q 


Pio S inf H,(Po, Pre”, 


q<l 


A 


where L’ is the threshold in the log likelihood ratio test. 

Chernoff’s original ideas have been generalized in several directions. 
Kraft?’ obtained upper and lower bounds on the total probability of 
error. For some choice of L’ (see also Ref. 28): 


4 min (19, m1)Hi(Po, P) <s Pr < (wom1)?H3(Po, Py); 


Hellman and Raviv” have also worked on this problem. Shannon, 
Gallager, and Berlekamp®* obtained lower bounds on the probabilities 
of error of the first and second kind in terms of the logarithm of 
H,(Po, P1), and the first and second derivatives of the logarithm. 

Here the Kakutani inner product plays two key roles, providing a 
check on whether or not singular or perfect detection is possible 
[iff H,(Po, Pi) = 0], as well as giving exponentially sharp bounds on 
the performance of the log likelihood ratio test if detection 1s not singu- 
lar. Since the Kakutani inner product need only be calculated at a 
small number of values of q to accurately numerically approximate 
upper and lower bounds on error probabilities, unlike calculating the 
probabilities of error of the first and second kind from the log likelihood 
characteristic function, this approach may be useful as a practical 
design tool because it is relatively inexpensive. 

The following observations are strightforward exercises: 


(t) When a sequence of N i.i.d. random variables is observed, 
H,(Po, P1) = e-4%, where A is independent of NV, depending 
solely on Po, Pi, and gq. 

(it) When Py and P, are absolutely continuous with respect to 
Lebesgue measure, and the corresponding densities are unimodal 
translates of one another, then for fixed q, the larger the separa- 
tion the smaller the inner product H,(Po, P1). 
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The Kakutani inner product H,(Po, P1) can be explicitly calculated 
for the three analytic cases discussed earlier: 


Gaussian (a = 2, -1 $B S11): 
Dn(&) = 1/V4ac exp(—22/4c?) —-x <4r< ow 
H (Po, Pi) = eX%#™@, u(q) = In [ pi(a — s')pi-*(a — 8)dx 
u(g) = —@(1 — g)(s' — 8°)?/4e?; 
1 gl a g° 2 
° (3) aa ( 4c ) 


Cauchy (a = 1, 8 = 0): 





Pilz) = +c) —-x<¢4< ow 


Cc 
a 
H,(P Pi) =| [~ phe — spe - de] 





ne Se es | _yv_ C (A/2)? | 
1 g)a: : ‘ 
Pe Fula 2559 27; >) 


where A = (s! — s°)/2. 
From tables (Ref. 30, 263.00) for elliptic integrals: 


Hi(Po, P1) oe 4 ss Eins 
= (CY HP oe [a [EY ey Y 


where cn“!(-, -) is an inverse Jacobian elliptic function. 








Pearson V (a = 3,8 = —1): 


1 (gy ah 
eee, ats é CI LL xv > O 
Dalz) = cv2nr \ Cc =z, 


0 z <0 
oo N 
HPs, Ps) =| [” phe — spt — ade 


The integral could not be expressed in any other analytic form. Since 
P, is absolutely continuous with respect to Po, but not vice versa, 
H (Po, P1) is continuous for g € (0, 1], and is discontinuous at g = 0. 
Apparently only in the gaussian case does the Kakutani inner product 
or the Hellinger integral reduce to a simple form, and for general stable 
distributions the problem appears to be analytically intractable at 
present. Thus, it seemed worthwhile to investigate numerical methods 
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HELLINGER INTEGRAL H 


for approximating the desired integrals. Again it seems important to 
emphasize that an accurate approximation of the log likelihood prob- 
ability density Laplace transform under H;, or Ho is needed at only a 
small number of choices of g, so the calculations can be quite inexpen- 
sive. In the previous section, the log likelihood characteristic function 
had to be approximated at a great many frequencies, and the resulting 
computation effort and storage made that program relatively more 
expensive. 


4.4 Numerical approximation of performance bounds 


At present, three approaches have been investigated for calculating 
stable probability density functions. The first involves summing power 
series and asymptotic series,*! the second involves quadrature of an 
integral representation of the density,*® and the third uses a discrete 
fast Fourier transform of the characteristic function (Ref. 33, pages 
35-42; and Ref. 34). 

The approach used here was a combination of the first and third 
methods. The stable probability density function was approximated 
over its central region via a discrete fast Fourier transform, while 
asymptotic expansions were used outside this region. This approach 
avoids the difficulty of knowing how to merge the power series and 
asymptotic series (see Ref. 31). 

The Kakutani inner product was broken into two integrals. The first 
integral was approximated by a fixed step size Romberg integration 


1.00 O O O . 
a) 
= WS 
Q O 
eo 
Q 0 
oo O e ~ 42 =05 
H=f af P(x - 8!) p(x - 8°) dx 
— oo Q — U a=0.7 
0.1 Q 
p(x) = p(x; a, B= 0) e “a=0.9 
.) 
e' 
a=1.1 
C) 
Q 
a=1.3 
Y) N. 
0.01 a=1.5 
e 
a=1.7 
a=2 0 
> a=19 
0.001 
0.01 0.1 1.0 10.0 100.0 1000.0 


ere 


Fig. 6—Hellinger integral vs (s! — s°)/e [a = 0.5(0.2)1.9, 8 = 0]. 
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H= f /p{x -s1) p(x -s®) dx 


p(x) = p(x; a, B = 0) 


HELLINGER INTEGRAL H 





=) 


Fig. 7—Hellinger integral vs (s! — s°)/e [a = 1.90(0.01)2.00, 6 = 0]. 


routine*® using the discrete fast Fourier transform approximation to the 
density (typically, 4096 points were used). The second integral was 
approximated by a variable step size Romberg integration algorithm 
using the asymptotic expansion for the density. 

While this approach is adequate for finite mean stable distributions 
(1 < a@ S 2), and with care works for 0.5 S a S 1, it is inadequate for 
0 < a < 0.5, because the expense is too great at present. The reason 
is that forO < a < 1, a great many evenly spaced points must be used 
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to adequately approximate the characteristic function in the neighbor- 
hood of the origin (where its derivative is unbounded), as well as at 
other frequencies, and the expense of storing these values (to carry out 
the discrete fast Fourier transform) is prohibitive. One possible ap- 
proach around this problem is to simply use only the series expansion 
(see Ref. 33). 

All results presented here were calculated on a Honeywell 6070 
computer using double-precision arithmetic (14 significant figures) ; the 
estimated relative error in all cases was less than a tenth of one percent. 

Figure 6 shows the Hellinger integral for various parameters 
[a = 0.5(0.2)1.9, 8 = 0] as a function of [(s! — s8°)/c], for N = 1. 
This figure suggests an interesting conjecture, that the Hellinger 
integral is smaller the closer the characteristic index a is to two, all 
other factors being the same. No proof of this is known, at present. 

Figure 7 depicts results of numerically calculating the Hellinger inte- 
gral for various characteristic indices close to two [a = 1.90(0.01)1.99, 
8 = 0], for N = 1. The singular nature of the gaussian distribution 
(a = 2) is quite evident when compared with that of a = 1.99 or 
a = 1.98. 

Figure 8 shows u(q) vs q for fixed [(s! — s°)/c_]. Again, the closer the 
index is to two, the smaller the inner product. 

Figure 9 presents u(q) vs q for various choices of [(s! — s°)/c], and 
fixed characteristic index a and skewness parameter 8; the larger 
(st! — s°)/c, the smaller H,(Po, P1). 


4.5 Comparison of the performance of the log likelihood decision rule 
(a = 1.95) with a linear decision rule 

It is interesting to compare the performance of the log likelihood 
decision rule with a linear decision rule, when the observations are 
drawn from a nongaussian stable distribution with characteristic index 
near two. To be explicit, it is assumed the observations are 1.i.d. stable 
random variables (a = 1.95, 8 = 0), with my) = 7, = 3 and s! = —s? 
= S chosen for simplicity. The linear decision rule is simply 


N My 
Dy ry S 0. 
t=1 Ho 


This sum is a stable random variable, with parameters (a = 1.95 
B = 0, Ny, Ns‘), assuming H;(7 = 0, 1) is true. The total probability 
of error is equal to the probability of either an error of the first or 
second kind, 

ig E> bg 10 = im 01; 


and can be computed from the series described earlier, or from pub- 
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uA(q) 





0 
0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 
q 


Seavey as of Kakutani inner product H, vs q [a = 1.1(0.4)1.9, B = 0] 
oe — s°)/c = 10]. 


lished tables.#! This is plotted in Fig. 10 as a function of [(s! — s°)/c] 
for various NV. The same figure includes plots of the Hellinger integral 
upper bound on the total probability of error using the log likelihood 
decision rule. The figure makes it quite clear that the log likelihood 
decision rule, for many cases of interest, has a much much smaller 
probability of error than the linear decision rule. 
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Asymptotically, the total probability of error for the linear detection 
strategy behaves as 


ren o([iSa]). v8 
Pz ~ O[(S/c)-*N*-], 


u(q) 


CHARACTERISTIC INDEX a = 1.90 
SKEWNESS PARAMETER £ = 0.0 





0.2 0.3 0.4 


0.5 0.6 0.7 0.8 0.9 
q 


Fig. 9—Logarithm of Kakutani inner product H, vs g [(s! — s°)/c = 1, 2, 10, 100] 
(a = 1.90, 8 = 0). 
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(OPTIMUM PROCESSING) 
m——— Pe (LINEAR PROCESSING) 


H-[ Jo sty pix 30) ax] 


p{x) = p(x; a, B, y, 5) 


a=195 y=ct 
B=00 8=0 





0.1 0.3 7.0 3.0 10 30 100 
gicesa0 
() 
Fig. 10—Linear processing probability of error and Hellinger integral upper bound 
on nonlinear processing probability of error vs (s! — s°)/c (a = 1.95, B = 0). 





while the probability. of error for the log likelihood detection strategy 
asymptotically behaves at 


Pr = O(e4%), 
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where A = A(a, B, y, S) > 0, independent of N. This simple asymp- 
totic analysis suggests that the log likelihood decision rule has a much 
smaller probability of error than the linear decision rule, for large N, 
which is borne out in Fig. 10. 


4.6 Comparison of the upper and lower bounds and Pz 


It remains to compare the bounds on total probability of error, and 
probabilities of errors of the first and second kind, with the actual 
quantities. None of the bounds employed here are tight, because the 
upper and lower bounds have different exponents. This program is quite 
difficult, and has only been carried out analytically for the gaussian 
case, and numerically for the Cauchy case. The remaining cases can 
be handled numerically following Shannon et al.”5 For simplicity, from 
this point on it is assumed that mp9 = m1 = 3, S' = —S° = 8. 


Gaussian (a = 2, -1 8 S 1) 
Earlier it was shown that 


1 N 
Pr = Pu = Pu = dexfe (WW). 
This can be upper and lower bounded tightly by (see Ref. 20, eq. 
7.1.18) 
Kye-Nel4e? << Py < KyenNetlset, 


where 
_1/wNs Ns? = 4 
Ki =5:(7F + a +8) 7 
K, = 1 VWs Ns? + =e ra 
2 Cc Ce ry 


Since both K;z and K, behave as O(N-}), Pz ~ e7~N#/4?-01ZN(W)1, where 
K,, and Ky, introduce factors of log (N) in the exponent. The Hellinger 
integral bounds are ”” 


1 3? 1 Ns? 
zen (—N 5.) < Ps <5exp (- a) 


By inspection, the exponent in the upper bound agrees with the tight 
lower and upper bound exponent [to within a factor of LN(N)]. The 
Chernoff upper bounds** on Pio, Po. are 


Po S$ expL—N@*(s/c)?] 
or Pio S expL—N(1 — g)?(s/c)?], 


lA 


for some g € [0, 1] 


and for q = 4 these exponents agree with the tight upper and lower 
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bound exponents to within a factor on LN(N). The lower bounds®> are 


Po > texpl—N¢@"(s/c)? — q(s/c)V2N] 
for some q € [0, 1] 
or Py > 4exp[—N(1 — @)%(s/c)? — (1 — g)(s/c)V2N J, 


and for N sufficiently large, the upper and lower bound exponents 
are identical within a factor of O(N-?). 


Cauchy (a = 1, B = 0) 


The real and imaginary parts of the characteristic function of the 
Cauchy log likelihood were calculated numerically at 513 evenly 
spaced frequencies starting at v = 0 from a direct numerical quadrature 
of the (complex) integral 


g(v) = 2 exp (iw In @+sPte )( 2 dx 


(¢@—s?te/)\ar/] (a —s?4+e’ 
v = kdo, k =0, ---, 512 


using an adaptive, step-size, Romberg, numerical integration algorithm, 
with an estimated error of 10-" (all arithmetic was performed in 
double precision). One representative characteristic function is plotted 
in Fig. 11. The stationary-phase asymptotic expression was used for 
frequencies outside of this range. The resulting approximation to the 
characteristic function was multiplied by itself N times, and a numeri- 
cal approximation of the inverse transform of this resulting characteris- 
tic function was calculated, using a fixed, step-size, Romberg algorithm 
for the first 513 frequencies; an adaptive, step-size, Romberg algorithm 
was used for the tail of the inverse transform. The final results are felt 
to be accurate to three significant figures. The results are plotted in 
Fig. 12, along with the Hellinger integral upper bound. Clearly, the 
Hellinger integral upper bound is quite conservative; it is straight- 
forward to check that the Hellinger integral (squared) lower bound is 
too optimistic, from the curves in Fig. 12. 


4.7 Generalizations 


The extensions of the results in this section (as well as the following 
section) to a much wider class of infinitely divisible distributions is 
immediate. Here these extensions are sketched. Elementary arguments 
(Ref. 15, page 540) show that if the Lévy measure of an infinitely 
divisible distribution behaves asymptotically as a power, 1.e., v(X, ©) 
~ O(X-?), v(—«%, —X)~ O(X-9, then Pr[a > X]~ O(X-»*), 
Pr[« < —X]~ 0(X~9), where p, gq > 0. Given a sequence of 1.i.d. 
random variables drawn from such a distribution with one of two 
location parameters, it is straightforward to check that results analo- 
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REAL PART OF CHARACTERISTIC FUNCTION 


IMAGINARY PART OF CHARACTERISTIC FUNCTION 





Hy: r=sltn 
Ho: r=s9 +n 
3! =0.5, 99 =_05 


Hy: r=slt+n 


Ho: r=s0 +n 
s! = 0.5, 8° = -0.5 


0 5 10 15 20 25 
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Fig. 11—Cauchy log likelihood characteristic function. 
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——— %H (UPPER BOUND) 
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5.0x 10 Pe (OPTIMUM PROCESSING) ‘| 


\ 
dx N 
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2.0x 107% n-[ Spe [(X - $)? + C2] [(X +S)? + C2] Jy 
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Fig. 12—Log likelihood probability of error and Hellinger integral upper bound 
for Cauchy (a = 1, 8 = 0) samples vs (s/c). 
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gous to those in this section hold: (2) l(r;) ~ O(r7"'), (cz) the probability 
of an error of the first and second kind, using a log likelihood ratio test, 
is upper bounded by exp(—AWN), (zi¢) using a simple linear test to 
discriminate between hypotheses, i.e., adding up the observations and 
comparing the sum with a threshold, results in the probability of an 
error of the first or second kind behaving as 0(NL’-*), 0(NL’~2), and 
choosing L’ directly proportional to N (as in the gaussian case) gives 
Po, Pio ~ O(N1-?), 0(N1~9), which is much worse than the perform- 
ance of the log likelihood test in this asymptotic sense. 


V. DISCRETE TIME DETECTION OF STABLE MEASURES WITH 

DIFFERENT SCALES 

In this section, one approach is studied for hypothesis testing of 
different scale parameters; since the ideas are quite similar to that just 
developed, the treatment is much shorter. 

One of two sequences of i.i.d. stable random variables is observed 
(under one of two hypotheses, Hy and Hj): 


A, Tr. = sn, 
Ho ry = 8nz Lskesn. 
The observed or received sequence is denoted {nxz}{’, where the {ni}! 
are 1.1.d. stable random variables with known parameters (a, 6, y = 1, 
6 = 0); both s! and s° are known. The a priori probability of H; is 7; 
(7 = 0, 1). The measures induced by {nx} under Ho and H, are 
equivalent for (0< aX 2, —1 S81); it remains to find the 
optimum decision rule, the log likelihood ratio, and characterize it 
performance. 


5.1 Likelihood ratio test 


Before discussing the general case, the three special analytically 
tractable cases are treated. 


Gaussian (a = 2, -1 $B <1): 


Dalz) = mo SS Be 


cay Dales) /8 sg OT (5 a (3 AB 
Uri) = Glos =e sh 4 sl so} |? 
Mas isu 2) a2) al) eee 
> i=l ae . s} 2s! 28° i=1 : 7, : 
The test involves squaring the observations and comparing with a 
threshold; this test 1s the well-known chi-squared test (see Ref. 5, 
pages 163-173). 
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Cauchy (a = 1, 8B = 0): 


pala) =2 (t+ 0 <a<o; 

a palti/st)/st _. (®\ rb (8, 
ry) = In e/a) = in (4) Int (aye 
ay r + rt (8°)? A ; 
_vn (4) - yin r+ (ss oe 


For |r:| « 8°, s!, Taylor scries arguments show l(r;) behaves as 72, just 
as in the gaussian case. However, unlike the gaussian case, where 1(r;) 
behaves asymptotically (|r;| >> s!, s°) as 0(r3), here l(r;) ~ In (s!/s°) 
+ O(r7?); again, large excursions are soft limited, or essentially 
discarded. 


Pearson V (a = 3,8 = —1): 


1 
—— yo tew*# a> 0 
Pr(x) = {es 
0 Hee 


l(r;) i sc a 2 id (r;/s')/s! 


” palri/®)/9 
-{-2™ n (5) - 5. [or — 90] (r: > 0); 
0 (r; < 0) 
A'= — Fin (5) — 5 (s! — s°) ps a =) 2 Li. 


Again, large deviations in r; are soft limited or weighted lightly, since 
asymptotically (7; > s', s°)l(r;) behaves as O0(r;'). 

The remaining cases can be treated in identical manner using the 
power series and asymptotic series expansions for the stable probability 
density function. For (0 < a < 2, -1 < B < 1;a ¥ 1), the important 
points are: (z) for |r:| < s°, s!, the 7th term (6 # 0) in the log likeli- 
hood behaves as r;, unlike in the gaussian case, while for 6 = 0, 
L(r;) ~ 73, (72) for |r;| > 8°, s!, soft limiting of large deviations is used, 
and the log likelihood’s 7th term behaves as a In (s!/s°) + 0(|r:|79). 
Figures 13 and 14 show representative log likelihood ratios for fixed a 
and varying 8, and fixed 8 with a varying, respectively, computed from 
power series and asymptotic series. *! 

The final case (0 < a < 2, 8 = 1, or B = —1) must be handled 
with a little more care. Only the case 8 = —1 is discussed, since the 
other follows immediately. For (1 < a < 2), the first point made above 
is still valid, while the second point is valid only for r; > 0, r; > 8°, s!. 
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For r; < 0, |ri| > 8°, s!, l(r;) behaves as a In (s!/s°) + 0(— |r;|!-2), 
i.e., decreasing with |r;|. For (0 <a < 1), for 7; > 0, 7: « 8°, s}, the 
ith term in the log likelihood behaves as 0(r;-9/!-”). Finally, fora = 1, 
Ll(r;) = Of — exp[(/2) |r:| J} as r;— —. 

5.2 Performance limitations 


The general problem of finding Pz, Poi, and Po for arbitrary stable 
distributions is still open, both analytically and numerically (because 
of expense). The three special analytic cases are treated here, to point 
out the problems that must be overcome in the general case, if one 
attempts to find the log likelihood probability density by transform 
methods. 

Gaussian (a = 2, —1 <8 <1): assuming hypothesis H;(7 = 0, 1) 
true, the Fourier Geaguatorrn of the log likelihood probability density is 


E(e4’|Hy) = (3)" fh = (3 ) 7 jy" 
B (ei |H) = (5 i) {1 + iv 1) - yp 


These Fourier transforms can be inverted: 
0) 2 N/[2 
p(x|Hy) = ( ome cami) ie 
(s°)? Wi 
(s!) (s°) v=A'—N In (%)>0 


ee (Cap i aN 2-1 s! 


-exp( — yet) / TOD) 
p(«|H1) = p(x|Ho) = 0 x= A’ — Nin($ i) < 0. 





Finally, the probabilities of errors of the first and second kind are: 


Py =1—- < (=tiey — (s°)?] a 





(s°)? 
ae (Fst + FEL SA) / raven 
Po = x(a)" 
aP(SsL + git )/ TO 


LP > N In.-(°/ 84) 
Py = 1, Po, = 0 L'<Nin (s°/s?). 
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Fig. 13—Representative log likelihood functions (a fixed, 8 varying). 
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CHARACTERISTIC 
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Fig. 13— (continued) 


Cauchy (a = 1, 8 = 0): the log likelihood characteristic function under 
Ay is 


E (es' | H;) 


I 


{ {25 ($)" be + te + yep ael" 


_o 7 \ 8° 
’ ; ul ie al hed 
{ (/sirhals [+1 5511-1 + (5) ]] ‘ 


where s' > s° was assumed. Stationary phase arguments show that the 
characteristic function decays asymptotically as 0(|v|—*/?). Again, the 
Fourier inversion lemma guarantees that the problem of finding Po: 
is solved. A similar analysis holds assuming H, is true. 

An alternate approach is to compute the Mellin transform of the 
likelihood probability density function; the results are 


weartia) = {($) [agi 1+ (S) |p 
racing = [(S)-a fens (I 


Unfortunately, it is not clear how to invert this transform to find Po 


and Po. 
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Fig. 14—Representative log likelihood functions (@ varying, 6 fixed). 
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A third approach is to convolve the probability density of the log 
likelihood with itself N times; for N = 2, the convolution involves 
elliptic integrals ; successive convolutions are quite formidable. This was 
not investigated further. 


Pearson V (a = 4,8 = —1): the log likelihood characteristic function 
is (assuming now s° > s!) 


E(es'| 1) = {pace /[1 > (5 = 1) \P" 
E(e*4’| Ho) = {eyes /[3 aici (1 7 =)" 


The log likelihood probability density is 


1 = s! ie (N/2)—1p~—(s80—s1/ 81) x N 
p(a Hi) = (==) x é P 2 


A 
, 2. 8° pie (N/2)—1p—(s0—s1/ s0) N 
pa'|H) = (gia) 20Pe r(5 


p(A"|Hs) = p(A!|He) =0 a’ < Zn (s"/s!). 





win (s!/s°) > 0 


The probabilities of errors of the first and second kind are 


2 [( L’'(s° — 81) \X? N_N L’(s! — 8°) N 
a mae) wi (gig tt) /r(F) 


L' > xin (s1/s°) 


2 [ L’'(s° — s!) \V? NN L’(s! — s°) N 
Pum (AGO) n(gg tu) /t (a) 


P49 = 1, Pou = 0 L’ < vin (siya), 


=| 


Again, the general problem is still open analytically, because closed- 
form expressions for stable probability density function are unknown 
at present (except for the three cases covered here). The general 
problem is expensive to tackle numerically at present, because of the 
expense of both calculating and storing the characteristic function of 
the log likelihood probability density, and because of the expense of 
repeating these calculations for many different parameter choices. 
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5.3 Analytic performance bounds 


Apparently only in the three special cases does the Kakutani inner 
product reduce to simple expressions. These results are recorded here, 
while Section 5.4 discusses numerical approximations of these integrals 
for various cases of interest. 


Gaussian (a = 2, -1 5 6 S11): 
Pa(x) = ot —-x<r<om 
0 1 Q 1 l—gq N 
H (Po, Pi) = {f° | z-(3)| | a> (3) ax| ; 
1)—qN N Lo a\ rs 
Hq(Po Pi) = (a) (7 ESE), 
1 0 1 —N/2 
HP, P) = |5(545)| 
Cauchy (a = 1, B = 0): 


pa(a) = + (a +1) -ex<xr<cow 


© 1 q 1 1—q N 
nasty =| [3m (3)} [2e(2) Ps" 
cae (ee | x\ {ef 1 a 
wmf [$n(2)I [de (3) 
_ fs 1 (s')?\. 
= (5) (pati -14+ Gy) 
The Hellinger integral can be evaluated from the tables in Ref. 30 
263.00: 
‘, 1 ils! — 9°) \2 1" 
my(Po Pd = | Fort (1, Ey 


Pearson V (a = 3, 8B = 1): 


lx 


Bly 





1 
—— z-ie-i* >0 
a(t) = \\on he 
0 x<0; 
2 (s!) 9(s°)}~¢ ,s 
HPo P= (area ae) } 
ov 100 \N/2 
Hi (Po, P) = (S35) ° 
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5.4 Numerical approximation of performance bounds 


The methods and checks employed were identical with those used 
in the detection of location for accurately calculating the inner product 
of the two stable probability measures. 

Figure 15 shows »(q) vs g for fixed (s'/s°) and [a = 1.1 (0.4) 1.9, 
8 = 0]. This raises the conjecture that the closer the characteristic 
index is to 2, the smaller the Kakutani inner product. 

Figure 16 shows u(q) vs q for fixed (a, 8) and various values of 
(s'/s°) : the smaller the (s!/s°), the smaller the H,(Po, P1). 

Figure 17 shows H;(Po, P1) for various (a, 8) as a function of 
(s'/s°) ; note that the case a = 2 does not appear to be singular here. 


5.5 Comparison of performance of log likelihood decision rule 
with a chi-squared test 


How does the performance of the log likelihood test compare with 
that of a chi-squared test, in particular for characteristic index a near 2? 
The chi-squared test involves 


i 


AVS 


N © 
ri 
t=1 


my 


The distribution of any one of the r? can be found from the series 
described earlier: 


Vri; a, B, (s')4,5=0] 7 >0 
0 r, < 0. 


1 
p(r3|H) = 4 ary, PL? = 


The discussion now follows from that in Section 4.6, but is not as 
detailed. Using elementary arguments (Ref. 15, pages 268-272), it can 
be shown that if0 <a < 2, —1 < 6 <1, then 


N 
Pr ( S> L'|H) ~ 0(NL-(2!®), 
t=1 


If L’ is set at a threshold which is a fraction of N, then 
Pp ~ O(NMI2) ; 


i.e., the probability of error grows with N, the number of observations. 
For comparison, the upper bounds on Pq, Pio, and Pz for log likelihood 
detection all behave as 0(e-4”), where A depends on (a, 8, s', and s°). 
Thus, the log likelihood test is asymptotically far superior to the chi- 
squared by the above argument. 
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u(q) 


H,: r=s'n 


Hp: r=s0n 
s! = 1/16, s0 = 1 
B=0 





0.0 0.1 0.2 03 04 0.5 0.6 0.7 0.8 0.9 1.0 
q 


wind aoe of Kakutani inner product H, vs g [a = 1.1(0.4)1.9, 6 = 0] 
s!) = 16. 


VI. DISTINGUISHING STABLE PROBABILITY MEASURES WITH DIFFERENT 
CHARACTERISTIC INDICES AND SKEWNESS PARAMETERS 


For completeness, this section touches on the form the log likelihood 
test takes for discriminating between stable distributions with different 
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characteristic indices and with different skewness parameters. Per- 
formance of this test will not be covered here; much of the earlier 
discussion on performance is applicable here. A table in the Ap- 
pendix summarizes the behavior of l(r:) both asymptotically and for 


u(q) 


s! = 1/16 
Hy: r=stn 
Hg: r=s0n 
s0 = 1 
a= 1.90, B=0.0 





‘0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 
q 


Fig. 16—Logarithm of Kakutani inner product H, vs g [(@ = 1.90, 6 = 0), 
(s°/s!) = 1, 4, 8, 16]. 
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HELLINGER INTEGRAL H 


foe} 
umf J 0(2)50(5)s 
= — —-|— — }dx 
—co Res sl oy 50 


p(x) = p(x; a, B = 0) 


sO = 1 
51 = 1/2, 1/4, 1/8, 1/16 





1.0 2.0 5.0 10.0 20.0 
(sO / 51) 


Fig. 17—Hellinger integral vs (s!/s°) [a = 0.7(0.2)1.90, 6 = 0]. 


[r;| <1, and includes both the results in the Sections 5.4 and 5.5 as 
well as the results of this section. 

One of two sequences of i.i.d. stable random variables with known 
parameters is observed. In Section 6.1, the parameters are (a’, 8, y = 1, 
5 = 0), where 0 < a < a! S 2; in Section 6.2, the parameters are 
(a, B’, y = 1, 6 = 0), where —1 < B° < #! S$ 1 (recall j = 0, 1). The 
special case (a = 1, |8| = 1) is covered in the table in the Appendix 
but not in the discussion here. 


6.1 Distinguishing different characteristic indices 


For —1 < 6 < 1, the measures Py and P, are equivalent, so the log 
likelihood ratio is always finite. The log likelihood test is 


N Ay 
A’ = Dy U(r:) z L’, 
i=1 Ho 


where 


Noes Pulte; a, B, Y 1, 6 0). 
p= 1h ie By = 16 = 0) 
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Two cases arise: symmetric (8 = 0) and asymmetric (6 # 0, —1 < 6 
<1) stable distributions. For the symmetric case, the distributions 
are symmetric about their unique mode, and thus I(r;) ~ |r3| for 
|r;| <1. For the asymmetric case, the modes no longer coincide, and 
Ll(r;) ~ r: for |ri| <1. Recall that for 1 < a < 2, for fixed skewness 
B (8 <0) the mode decreases as a decreases; for 0 < a < 1, the op- 
posite is true. Thus, [(r;:) is the difference of two unimodal functions 
and, in general, should have two points of zero slope. For |r;| > 1, 
l(r:) = 0(—In |r;|), so large deviations are weighted quite strongly. 
Note the log likelihood distribution has its support on whole line, un- 
like the two previous sections, except for (0<a»)9<1S a S 2, 
[8| = 1). 

For 8 = —1, and 1 < a < a! < 2, the measures Po and P: are 
equivalent, and the above discussion follows immediately with one 
exception: for r;>>1, l(r;) = 0(—Inr;), while for |r;| > 1, ri: < 0, 
L(r:) = O( |r; | ao/ao—1) 

For 8 = —1,0 < a < a! < 1, the measures Py and P; are equiva- 
lent. For r;>0, |r:i| «1, U(r) ~re4-@, while for r;>>1L 
l(r;) = 0(—In ri). 

Finally, for 8B = —1, 0 < a& <1 <a! < 2, the measures Py and 
P, are neither equivalent nor mutually orthogonal. For r;> 1, 
l(r;) = 0(—Inr;), while for r; < 0, l(r;) = ©. For 7; >0, i <1, 
L(ri) = O(rz/@-), 


6.2 Distinguishing different skewness parameters 

For —1 < 6° < 6! <1, the measures Py and P; are equivalent, so 
the log likelihood ratio is finite. The discussion follows that of Section 
6.1 exactly, with the difference that if r;>>1, l(r:) = In (Ri/R») 
+ O(r7%), while if |r;| >> 1, 7: < 0, U(r) = In (Li/L0) + 0(|7r:|72).* 

For -—1=6°< 6! <1, 1 Sa < 2, the measures Py and P, are 
equivalent. For |r;| >> 1, r: < 0, l(r;) = 0(|r:|¢/*"), while for r; > 1, 
U(r) = In (Ry/Ro) + O(77-%). 

For -1=6°<p#' <1, 0<a<1, the measures Po and P; are 
neither equivalent nor mutually orthogonal. For 7; >> 1, l(7r:) 
= In (R,/Ro) + 0(77-%), while for 0 <r; «1, L(ri) = O(r9/*). For 
0> 1, l(ri) = ©. 
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APPENDIX 
Asymptotic Behavior of Log Likelihood Ratio 
A.1 Location (8) 


R ep OR 


I(x) = In 
<2 
<a<2,-1<p<1 
<a<2,8= -l 
=1p=-1 


0<a<1,6B=-1 


p(x; a, B, Y 51) 
D(z; a, B, 7’) 50) 


ra +o 
0(zx) 
0(a-) 
0(a-) 
0(2-) 
r—> +o 
O(a) 


59 < b4 


I> — © 

0(x) ) 
0(a-) 

0(—[x|¥—) 
0(—e(r/2)12—811) 

xls 

O(— (a — 6,)¢/e-1), 


A.2 Scale (c) 


x3a,B,y1 = ci,6 = 0 
U(a) = In PEL Faces 

r— +o t—> —o 
a=2 0(2*) 0(22) 
0<a<2,-1<6B <1* aln (e1/eo) + O(a-%) ain (c1/e0) + 0(|x|-2) 
l<a<2,8=—-1 a In (¢1/ceo) + O(z-2) 0(— |x| 2/27) 
a=1,8@=—1 a ln (¢1/¢o) + O(x-#) 0(—e(#/2)I2/e11) 

ro +a zl0 
0<a<1,6B=-1 a ln (¢1/ceo) + O(2-2) 0(—22/e-1), 


A.3 Characteristic index (a) 


U(a) = In Bie Y= Sa 0<a<a 52 
xr—> +o xr —o 
O0O<a<a,=2,-1<B<1 0(—x?) 0(—2?) 
0<a<a,<2,-1<B<1 0(—In z) 0(—In |z]) 
l<a<a=2,8 = —-1 0(—2?) O( |x| 20! ¢0-2) 
l<a<a<2,6= —1 O(—Inz) —_0(|a| 20/«0-2) 
l=a<a=2,8 = —-1 0(—2°) 0(e(*/2)121) 
l=a<a,<2,8 = —1 0(—I1n x) O(e(r/2) 121) 
r>++o 2x10 
0O<a<l<a<2,8= -1 0(—In 2) 0 (20! 20-1) 
0<ao<a=1,68 = -1 0(—In z) 0 (20! 20-1) 
0<a<a<1,8= —1 0(—In z) 0 (a0! @0-1) 


“This excludes the Cauchy (a = 1, 6 = 0), which was examined in the text as a 
special case. 
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2 2 


0 


A.4 Skewness (8) 


p(z;a,B1,y = 1,5 = 0 
(a) = nba Se -1S8.<f151 
r—7> +0 t—-> —o 
<a<2,-1< Bo < 81 <1 In (Ri/Ro) + 0(27-2) In (L/L) 

+ 0(|2|~9) 
<a<2,-1=£) <81 <1 In (Ri/Ro) + 0(a-*) O([2| %/27) 
=1,-l=£<f<1 In (R1/Ro) + O(a-2)  O(e/? 121) 
<a<2,-1=£,1= £1 0(—a2/e-1) 0(|2| #2) 
=1,-l1=6),1= , 0(—e(*/2)121) O(e(*/2)121) 

L> +o zl0 


<a<1,-1= Bo <6. <1 In (Ri/Ro) + O(a-2)  0(a2/27) 


R; = sin 5 (6; — a), tan (10,;/2) = B; tan (a/2) 
j=0,1 


Li sin 5 (6; — a), tan (76;/2) = —B; tan (ra/2). 
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Distinguishing Stable Probability Measures 
Part Il: Continuous Time 


By B. W. STUCK 
(Manuscript received April 1, 1976) 


A sample function from one of two stable, stationary, independent- 
increment processes is observed for a finite time interval. For differing 
location, characteristic index, skewness, or scale, the probabilities measures 
induced by the process under either hypothesis are found to be mutually 
orthogonal. By suitably modifying the Lévy measure associated with each 
probability measure, continuous-time tests for differing characteristic 
indices, skewness, or scale parameters can be posed as nonsingular detec- 
tion problems; distinguishing location remains a singular detection 
problem. For the nonsingular problems, the likelihood functional is 
found explicitly, and performance limitations are determined. As an 
alternative approach, the observed sample function 1s sampled at discrete 
time instants over a finite time interval, and the performance of log ltkeli- 
hood test 1s studied as a function of sample spacing with a fixed, total 
number of observations. 


I. INTRODUCTION 


In this paper, the work begun in Part I on discrete-time hypothesis 
testing of stable probability measures is extended to continuous time. 
In contrast to the earlier work, analytic closed-form expressions are 
found for both the log likelihood functional and Chernoff-type upper 
and lower bounds on various error probabilities for the log likelihood 
test. As in Part I, the singular role played by the gaussian probability 
measure within the family of stable probability measures is em- 
phasized, both in terms of the form of the log likelihood functional and 
the expressions for Chernoff-type bounds on error probabilities. The 
earlier work dealt with observing N samples from a stable process 
with one of two sets of parameters at time instants At apart; here, we 
fix the observation interval at duration 7, and allow the number of 
observations to become infinite while the spacing between samples 
shrinks to zero (V >, At — 0, such that N-At = T). 

Section II briefly reviews some properties of independent-increment 

1183 


processes and infinitely divisible distributions that were touched on in 
Part I. Section III draws on this tutorial material by considering an 
elementary hypothesis-testing problem for discriminating between two 
Poisson distributions with differing parameters.* Section IV briefly 
reviews some work by Newman? and Newman and the author* on 
calculating log likelihood functionals and Chernoff-type bounds on 
error probabilities for the path-space probability measures induced by 
independent-increment processes. These results are used in Section V 
to show if one or more of the parameters of the two stable-probability 
measures (0 < a < 2) differs, then the two path-space measures are 
mutually orthogonal. Section VI develops one remedy to this so-called 
singular detection by modifying the Lévy measure of the two distribu- 
tions to account for the real physical limitation that the process can 
only be observed to within an accuracy intrinsic in all measurement 
apparatus. Section VII considers a different but related issue, where 
the observed sample function is sampled at discrete time instants over 
a finite time interval, and the performance of the log likelihood test is 
studied as the sample spacing shrinks to zero; this allows one to trade 
off the sample spacing, or the rate at which samples are observed, for 
the total duration of the observation interval, or the total number 
of samples.t 

The results developed here are novel in that one can immediately 
ascertain explicit bounds on the performance of the likelihood ratio 
test, while it is not clear how to do this after reading the literature 
(e.g., see Refs. 6 through 10). The method of proof here relies on 
probabilistic semigroup tools or on the explicit nature of the sample 
paths of an independent-increment process, and this appears to be 
novel when contrasted with such approaches as those referenced above. 


I. MATHEMATICAL PRELIMINARIES * 


Let r;(¢) (7 = 0, 1) be a scalar real-valued random process, with 
right continuous sample paths with left-hand limits everywhere de- 
fined. More explicitly, let 7;(¢) be the sum of a deterministic drift 
process, 6,t, and N independent Poisson processes (labeled by &, 
1 =k # WN), where each Poisson process has rate A;, and hops of 
height h;,. In words, r;(t) has simple jump discontinuities of heights 
hj, 1 Sk SN, at random times. The characteristic functional of 


*The results in Sections III through VI were first announced in Proceedings of 
the 13th Annual Allerton Conference, University of Ilinois, Champaign-Urbana, 
Illinois, October 1-3, 1975, pp. 234-239. 

The results in Section VII were first announced in Proceedings of the 1976 Johns 
Hopkins Conference on Information Sciences and Systems, Baltimore, Maryland, 
March 30-April 2, 1976, pp. 151-154. ; 

* See Ref. 1, Section 3.1 and its list of references for much more information. 
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r;(t) is easily seen to be (¢ > s), 
N 
Ee ti(—-ri@)1] = exp {¢ — s) iv + p» Aju (errrik — » |}. 
=1 


If we now pass to the limit of an infinite sum of Poisson processes, 
then the jump amplitudes {h,;,} take on a continuum of values, and 
the characteristic functional becomes (é > s) 


ELettri)—ri0)1] = exp {( — 8) E +f ghent= Yaw || 


where vy is called the Lévy measure associated with the path- 
space measure of r;, and generalizes the rate parameter set {Ajx}; 
(t — s) fuea dv;(u) is the expected number of jumps of r; whose 
amplitude falls in the set A, in a time interval of duration (¢ — s). 
Lévy and Khinchin showed the following remarkable generalization 
of this heuristic development: 


Theorem (Ref. 11, p. 76): Let r;(é) be an R*” valued random process 
wrth independent increments. Then 


B(exp {io7*[rj(t) — rj(s)]}) = exp {¢ =) | ir ~ PTS 


: qwrky 
+f, (exp (tvT®u) — 1 — chim, ) ax(o) || ; 


where 6; © R”, S; is ann X n positive semidefinite matrix, and 


urky 
io Latha ao ureu dv;(u) <0, 


In words, any independent increment is the sum of three independent 
processes: (7) a purely deterministic drift process, completely specified 
by 6;, (¢2) a purely nondeterministic gaussian process with zero drift 
and almost surely continuous sample paths, specified by S;, and 
(77) a purely nondeterministic jump process with zero drift, a 
sum of independent Poisson processes with different rates and jump 
amplitudes, specified by v;. 

Historically, the mathematical study of independent increment 
processes concentrated first on the purely gaussian case (v; = 0); then 
on the purely stable case (S; = 0, dv; = du(@)dr/re, 0O< a < 2, 
where u is a positive measure on the unit sphere in R” and [7, 6] are 
polar coordinates in R”); and lastly on the general case, building on 
the insight gained in the first two cases.” A second reason for wishing 
to study the gaussian and stable (0 < a < 2) probability measures is 
that they arise naturally from studying limiting distributions of 
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suitably scaled and translated sums of independent, identically distrib- 
uted, random variables in the central limit theorem, and have found 
application in modeling noise in communication channels such as 
telephone lines.’ These two reasons, as well as others, provide the 
major impetus for the study to follow. The richness of the structure 
of independent increment processes suggests they may find more and 
more application in model building as their properties become more 
widely known. 


Ill. DISTINGUISHING POISSON PROCESSES 


In this section, 7;(¢) (j = 0, 1) is observed on the interval [0, T), 
and is the sum of a purely deterministic drift process (specified by 6,) 
and a purely nondeterministic Poisson process (specified by rate \; and 
jump amplitude A;). What is the log likelihood functional, and what 
is its performance? 

First, suppose the Poisson process has the same jump amplitude 
under either hypothesis, but the drifts differ. Then it is straightforward 
to show that the two probability measures Py and P1, associated with 
r; under hypothesis H;, are mutually orthogonal, so (z) observing r 
over any finite interval, the log likelihood functional takes on the 
value + © if H,is true, — © if Ho is true, and (zz) the probability of 
incorrectly choosing one hypothesis when the other is true is zero. 
The reason for this is clear on physical grounds: the Poisson component 
has constant sample functions with simple jump discontinuities at 
random times, while the drift process is continuous with constant 
slope. Thus, ignoring the jumps in the observation process, the slope 
of the continuous part of the sample path is 6;, and to discriminate 
between the two hypotheses is now trivial. From this point on, there- 
fore, it is assumed 6; = 69 and, without loss of generality, set 
6; = 0(7 = 0,1). 

What if the Poisson processes have different jump amplitudes? As 
soon as one or more jumps occur, it is possible to discriminate per- 
fectly between the two processes, since the size of the jump h; is 
associated with hypotheses H;. To avoid this indeterminancy, it is 
assumed from this point on hi = ho = 1. Thus, Po and Py, the 
probability measures associated with H;, are mutually absolutely 
continuous. 


Lemma 1: Let r; be as just defined. Let 


Pi | ( 42+ 7) 
nN 


denote the conditional probability of r; at time [(k + 1)/n]T, given vr; 





(ir Cees heer 
n } 
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at time (k/n)T. Then [r;(0) = 0 as.; ‘ = 0, 1], 


Gk In Se (”) = lim bi m= (0), 


(it) Hq(Po, Pi) = lim TH, oe . 
Proof: The proof follows from standard limit theorems (Ref. 7, Lemma 


1.1), Q.E.D. 
We now explicitly evaluate the limits in Lemma 1: 


Proposition 2: Given the conditions of Lemma 1, 


(i) A= [O |-@.-r0 + & fin (® *)] au — uw 
-[" | — Or — agate + n ( *) are], 


where NV, is the a.s. finite number of time instants {t,} where 
r, changes state. 


(i) Hg(Po, Pi) = exp {—T[q1 + (1 — @)do — AfAo* ]}}. 
Proof: 


(<) Given r; at time (k/n)T, it will remain in that state in the next 
time interval (7’'/n) with probability 1 — \;T/n + o(T/n), and 
will increase by one with probability \;(T/n) + o(T/n). The 
desired result now follows Lemma 1. 

(2c) If r; changes its state in the next time interval of duration 
(T/n), then 


tA ? T 
Hq(Po, Pi) = 7 AfAo * + 0(T/n); 


while if r; stays in its present state in the next (T'/n) time 
units, then 


H,(Pi, Pi) = 1 ~~ [ga + (1 — gra] + 0(P/n). 


¢ , T 
Hy(Ps, Pi) = exp | — 7 di + (1 = @)do — 910579 | 
+ o(T/n), 
A,(Po, P;) = exp {-T[@.1 7 (1 =, Q)Ao = Pro" HI}, 


where the last step follows from Lemma 1. 
Q.E.D. 


Recall from Part I that a crude bound on the total probability of 
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error Px for a log likelihood ratio test is provided by 
+ min (m1, mo) Hi < Pe =< V%1, moll ;, 
where 7; is the a priori probability hypothesis that j is true. Here, 
Hy = exp [—T(Va1 — VA0)?/2] 
and hence for fixed 7’, one would like to have the difference in the 
square roots of the rates as large as possible. 

To gain further insight into H,(Po, P1), we rewrite it as the ex- 
pectation of a third Poisson process. Let 2,(t) be a Poisson process 
with rate AfAo~%, hops of height +1, and 2,(0) = 0 as. (intuitively, 
the probability measure P, associated with x, has support on the 
common support of P» and P1). 


Proposition 8: 


Hy P) = i aPwce | - [ , DLe4(t) | 


= Ba, (exp {- ie Div4(0 et 


D (aq) = Qd1 + (1 — go — APNG. 


Proof: The proof follows from the definition of D, P,, and 2g. 
Q.E.D. 


To the best of our knowledge, this result is new, and will be generalized 
in the following section and elsewhere.** Its significance lies in the 
fact that there exists a large body of results in both the mathematics 
and physics literature for studying properties of expectations of 
multiplicative functionals of random processes, so called Feynman-Kac 
functionals; now we can immediately draw on this body of knowledge. 


IV. DISTINGUISHING INDEPENDENT INCREMENT PROCESSES 


In this section, the results of Section III are extended to arbitrary 
independent increment processes. Here, r; € R” is observed over 
[0, 7’), and has one of two sets of parameters (6;, S;, vj) (7 = 0, 1). As 
before, define for 0 < gq <1, 


Pp. py = (aP1\* ( aPo\ 
diig(Po, P:) = (GF) (5) du, P1, PoK pu 
H,(Po, Ps) = [ dha(Po, P2), 


where H, is the Kakutani product associated with Po, P1. Next, it is 
useful to define a nonnegative measure jg(vo, vi) [the generalization of 
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the point measure at +1 with mass g\1 + (1 — g)Ao — AfAQ7! in 
Section III], 


djq(Y0, v1) = gdvy sh (1 = qg)dvo = dhg(vo, v1); 
J (vo, v1) = [ dia(r, v1); 


and J, is nonnegative and may be infinite, since vo or v1 or both may 
not be finite measures. If Jz < ©, it is convenient to define 


ry dv; (u), 


°i Fee 1 + Tha 
gb: + (1 @)d0— fo pam, Sia, m0). 


bq 


Finally, if Sy = So = S, and Jz <~, a third independent increment 
process 2x,(t) is defined with parameters [6,, S, hq(vo, 71) ]. 


Theorem 4: For Po and P, not to be mutually orthogonal, it 1s necessary 
and sufficient for the following three conditions to hold: 


(2) J q(vo, v1) <0 
(22) Si=8=S20 
(277) bg € range (S). 


If these three conditions are satisfied, then 


GAGs i 7 ie - In Waire= = (it dv)at| 
+ 8288 Lrp — jo — (do + 41) 1, 


where d,r; assigns a point mass at time instants where 7; — ri_ = 4, 
i.e., where r; hops with amplitude wu, and 7; is the Jump process com- 
ponent of r;. 


(by He pP ie e | — 1a, n) — Fal - dag", | 


Proof (sketched)*: The proof is broken into two parts, one part dealing 
with the jump process, the other with the gaussian process (including 
drift). The part dealing with the gaussian component is classical,® and 
yields conditions (77) and (222), above. The main method employed 
in showing condition (7) for the jump-process component is to ap- 
proximate the jump process by a sum of independent Poisson processes 
with different rates and jump amplitudes. As more and more Poisson 
processes are included in this sum, it can be shown that the approxi- 


“From a detailed proof in Ref. 5. 
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mation converges weakly to the actual jump process. The Kakutani 
inner product of the probability measures of the approximations is 
simply the product of the Kakutani inner product associated with 
Poisson processes of the same jump amplitude (but possibly different 
rates); again, the delicate part of the proof is to show this approxi- 
mation converges to the actual Kakutani inner product of the path- 
space probability measures of the two independent increment processes. 

The program is to use this theorem in the remainder of this paper 
to exhibit the log likelihood functional and ascertain bounds on its 
performance in hypothesis testing for stable processes. Skorokhod’~* 
has obtained conditions (77) and (cz) in Theorem 2, and instead of 
condition (z) obtained two conditions which must hold: 


f (g —1)dy << and (g — 1)dv <~, 
lo—-1|>3 |o—3] 54 
where g = (dv:/dvo); it is easy to show these two requirements are 
equivalent to J4(vo, v1) <©. Hence, these conditions appear simpler 
than those of Skorokhod. Moreover, it is obvious how to use J, to 
determine performance limitations, while it is not obvious at first 
glance how to apply Skorokhod’s work. Also, the method of proof is 
different and may be easier to follow. 

Finally, it is instructive to rewrite H, as a Feynman-Kac type of 
functional of 2: 


Proposition 5: Let 2, be a stationary independent increment process 
with parameters (6,, S, hg) as defined previously. Then, 


H,(Po, P:) = Ex | exp (- i ‘ D(e,)at) 
= [ aP, exp |- [Dena 


D(a) = q(L — gQar"s3,/2 +f die 


where 


Proof: The proof follows immediately from the definitions of D, x,, Py. 
Q.E.D. 


Again, note that 


D(z;) = 5 | (82"/2)S-*(/2) ae I. (2 z ey au | 1, <p 


can be immediately used to provide a crude upper and lower bound on 
the total probability of error. As in the Poisson case, one desires the 
differences in the square roots of the Lévy measures (suitably defined) 
as large as possible, for good performance. 
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V. DISTINGUISHING STABLE PROCESSES 


Let 2;(t) (j = 0, 1) be a scalar real-valued representation of a stable 
(0 < a < 2), stationary, independent increment process, t € [0, T), 
with characteristic functional [2;(0) = 0 as.; 7 = 0, 1].* 


Ei (e#?2i(t)) = exp { E ey (cr = ; 3) arse) |} 





Gh vy =cLl[ul“7 u<0 
aS vy = —chu-ti u>O. 


Following Section IV, it is clear that Jq(vo, v1) diverges (to +), 
from simply substituting in the explicit form for vj and carrying out 
the calculations. Hence, Po and P; are mutually orthogonal if one or 
more of the parameters differ, the log likelihood functional is either 
+0 or —© on hypothesis one or zero, respectively, and the prob- 
ability of incorrectly choosing one hypothesis when the other is true 
is zero. 

Since J,(vo, v1) diverges because »; diverges as |u| — 0, this suggests 
that being able to observe the process perfectly, down to jumps of 
vanishingly small amplitudes, may be the mathematical reason for 
singular detection; but therein lies the flaw: it may well be physically 
impossible (the mathematical model is inadequate) to achieve this. 
Frost” apparently first popularized this idea in the engineering litera- 
ture; here we reach the same conclusions by entirely different methods. 
Sections VI and VII deal with two distinct methods for overcoming 
these flaws in the mathematical model. 


Vi. DISTINGUISHING PSEUDO-STABLE PROCESSES 


Let 2z;(t) (j = 0, 1) be a scalar real-valued representation of a 
stationary independent increment process just as in Section V, except 
that the Lévy measure is now written as 





ch |u|~a u<—L 
_ JAF(u) -L<u<0 
vj(u) = v4 (u) O<u<R 
—chu-ti RR <4, 
where 
ee aay Sees oe ie 
i ie |. Tee aa 
and 


, U 
6; = 6; ~ fo. To pe Wi). 


* The case a = 2 is well known®® and, for brevity, is not included here. 
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AL, AL are absolutely continuous with respect to Lebesgue measure, 
and y; is nondecreasing on (— ©, 0—), nonincreasing on (0+, ©). 
The limit, as both R and L approach zero, of a sequence of such pro- 
cesses can be shown to converge weakly to a stable process, and hence 
these processes are christened pseudo stable processes. Here L and R 
quantify that the fact that no negative jumps can be observed with 
amplitude less than L, no positive jumps can be observed with ampli- 
tude less than R. Both the properties of the sample functions and the 
one-dimensional distributions are radically different here from stable 
processes: (7) pseudo-stable process sample functions are of bounded 
variation w.p.l, with only finitely many nonzero jumps in any finite 
time interval; stable process sample functions are of either unbounded 
(1 <a < 2) or bounded (0 < @ < 1) variation w.p.1, with the set 
of time instants at which nonzero jumps occur being dense in any 
finite time interval, and (77) the set of one-dimensional distributions 
of pseudo-stable processes is clearly not closed under convolution, 
which was the defining property of stable distributions, but the 
asymptotic tail behavior is the same, since 


Pr [z;(¢ = 1) > x] ~0 @2 dv,(u)) ) 
Pr [a(t = 1) <-az] ~ 0 ce dv,(u)) 


For this special case, it is straightforward to show that Jq(vo, v1) <®, 
and hence condition (2) of Theorem 2 is satisfied. However, 6, is not 
in general in the range of S(=0), and again singular detection is 
possible. The reason is clear on physical grounds (ef. Section III, the 
Poisson case): the slope of the sample paths of 2;(¢) is 6;, ignoring the 
jump discontinuities, and hence it is trivial to discriminate between 
two pseudo-stable processes with different drifts. Two approaches are 
available: either let S be nonzero, which we do not pursue here because 
this seems ad hoc, having introduced L, R, already, or make the drifts 
match, 6; = 59, which we assume from this point on. 

The log likelihood functional is thus 


A(r,) = A at i In (dvi/dv)dre + i In (drs/dv ar. | 
+f" dt ta (—dr1 + dio) + de (—dv. + a) | 


where, for simplicity, it was assumed 
AL = ci D4, 04 =—ch R-4. 
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As expected, the form of the log likelihood functional is quite sensitive 
to whether a = 2 or0 <a < 2 (e.g., see Refs. 7-10 for a = 2). 

To obtain upper and lower bounds on the probabilities of an error 
of the first or second kind, and on the total probability of error, the 
Kakutani inner product H, must be calculated. Assuming AL = cL, 
\4. = —c4 R-%i, the result is 


Hf4(Po, Pi) = exp [—TJ (vo, v1) _], 
Jq(vo, v1) = q(cLL-™ + ch R-™) + (1 — g) (cL L-* + cf Ro) 
= (aoc?) 4(arrc)4 L7aa1- A-@) eo 
gai + (1 — g)ao 
= (aocy.)**(arrc4) 9 PR-aa1- 4a) a0 
gai + (1 — J) Xo : 


H,(Po, Ps) = Es, |exp | - i "D(a)at || . 
D (Gq) = ds 


In summary, discriminating between Wiener processes (a = 2) with 
different variances leads to singular detection, while if the variances 
are ldentical then the detection problem is nonsingular.*” Discriminat- 
ing between stable processes (0 < a < 2) with one or more different 
parameters leads to singular detection. If the Lévy measure is modified 
to be a finite measure, then if the drifts differ, singular detection occurs, 
while if the drifts are identical, then the detection problem is 
nonsingular. 


Vil. DISTINGUISHING SAMPLED STABLE PROCESSES 


The previous sections show that it is quite easy to find examples of 
continuous time singular-detection problems. In this section, it is 
assumed that N samples of a stable process with one of two sets of 
parameters are observed, and we wish to study the effect of choosing 
the sample spacing and the total length of the observation interval 
on the Kakutani inner product H,; the goal is to make H, as small 
as possible. 

Attention is confined solely to scalar processes from this point on. 
The distribution of x,;[ (k + 1) At] — 2;(kA?) is given by P;(At; 6;, S;, »;). 
The Kakutani inner product of the new two discrete time distri- 
butions is 


He(T, at) = | f (aPs/du)e(aPo/du)'-edu | 


For At|0 or To, with (7/At) = N fixed in both cases, fixed, 
that H, can approach one, some number between zero and one [say 
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e-*N, where k = k(61, 50, Si, So, v1, v0) J, or zero. It is obvious that 
if the two continuous time independent increment process path-space 
measures are not mutually orthogonal, then the only approach to 
reducing H, is to fix At and increase 7. However, if the two con- 
tinuous time independent increment processes have mutually orthog- 
onal path-space measures, then it is possible to reduce H, by decreas- 
ing At with (T7/At) fixed. To state the result, a lemma is needed: 
Lemma 6. If uw is an infinitely divisible probability measure, with 
v(u) ~ 0(|ul-*) as |u| 0,0 < a < 2, then 


fex (ivr)du(x) = exp[—S|v|* + D(v)], 0<aS2, 


where if 
u 
as er ee dv(u) = 0, 

then 

Lim D(v)/|v|* = 0, 0<aS2; 

| v| 00 
otherwise, 

D(v) = ivs + D’(v), Lim D’(v)/|v|* = 0, Lo =-2; 
| v| +90 


Proof: The proof follows from properties of v, and is found in Ref. 5. 
Q.E.D. 


The main result can now be stated: 


Proposition 7: For 0 <q <1, with a zero-drift gaussian component 
(a = 2) present in either x1, or xo, or both, if (T7'/At) is fixed 


Lim H,(T, At) = 
A tho 


(a) 1 iff S; = So > 0. 

(b) Exp (—kN) iff Si¥% So, Si > 0, and Sp > 0, 
k = In [g(So/Si)*-4 + (1 — g)(S1/So)2}. 

(c) 0 iff Si 0 = So or Si =0+ So. 


If a zero-drift nongaussian stable (0 < a < 2) component is present 
in either r; or 7, then 


Lim H,(T, At) = 
Attlo 


(d) 1 iff a1 = &o, Si — So. 
(e) Exp (—kN) iff a1 = ao, Si* So, Si > 0,1, So > 0. 
(f) O iff a1F~ ao. 
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Proof: The proof follows from scaling arguments, and is found in 


Ref. 5. 
Q.E.D. 


If a gaussian component is present in both processes, then decreasing 
the sampling interval has no effect on decreasing H,, and T must be 
increased to decrease H,. However, if no gaussian component is present 
in one or the other of the processes, or if a1 ~ ap, then it is possible to 
decrease H, by decreasing At with (7/At) fixed. 

Analogous results for 7’7— © with (7'/At) fixed are presented in 
Ref. 5, as well as some results on the rate at which H, approaches its 
limiting value. 

Related work on nonuniformly sampling a continuous time inde- 
pendent increment process with one of two drift parameters is available 
in the literature.“ A typical result is that sampling two stable processes 
with identical characteristic index, skewness, and scale, but differing 
drifts, is a singular detection problem if 


pF a 


3=0 


diverges, where {t;} are the sampling epochs, 
> (tiz1 — ty) = T. 


Thus, spacing the samples apart by tj41 — t; « j-™(m > 1) results in 
singular detection, but (¢;41 — t;) « e~™4(m > 0) may not. 


Vill. ACKNOWLEDGMENTS 


The author gratefully acknowledges many enlightening conversa- 
tions with P. A. Frost concerning continuous time detection problems. 


REFERENCES 


1. B. W. Stuck, “Distinguishing Stable Probability Measures, Part I: Discrete 
Time,” B.S.T.J., this issue. 

2. C. M. Newman, ‘“‘The Inner Product of Path Space Measures Corresponding to 
Random Processes With Independent Increments,’’ Bull. Amer. Math. Soc., 
78, No. 2 (March 1972), pp. 268-272. 

3. C. M. Newman, ‘On the Orthogonality of Independent Increment Process,’ 
Topics in Probability Theory, ed. D. W. Stroock and S. R. 8. Varadhan, 
aera oo of Mathematical Sciences, New York University, 1973, 
pp. 93-111. 

4. C. M. Newman and B. W. Stuck, “Chernoff Bounds for Discriminating Between 
Two Markov Processes,’’ unpublished paper. 

5. C. M. Newman and B. W. Stuck, ‘“‘On the Orthogonality of Path Space Measures 
of Infinitely Divisible Processes,’’ unpublished paper. 

6. R. H. Cameron and W. T. Martin, ‘‘The Behavior of Measure and Measurability 
Under Change of Seal in Wiener Space,’’ Bull. Amer. Math. Soc., 53, No. 3 
(June 1947), pp. 130-137; ‘Transformations of Wiener Integrals Under 
Translations,’ Ann. Math., 45, No. 3 (June 1944), pp. 386-396. 


PROBABILITY MEASURES—II 1195 


11. 
12. 


13. 
14. 


. A. V. Skorokhod, ‘On the Differentiability of Measures Which Correspond to 


Stochastic Processes: I. Processes Independent Increments,’’ Theory Probab. 
Appl., 2, No. 2 (April 1957), pp. 407-432. 


. A. V. Skorokhod, Studies in the Theory of Random Processes, Reading, Mass. : 


Addison-Wesley, 1961. 


. I. I. Gikhman and A. V. Skorokhod, ‘On the Densities of Probability Measures 
10. 


in Function Space,”’ Russ., Math. Surv., 2/, No. 1 (January 1966), pp. 83-156. 

P. A. Frost, ‘Some Singular Estimation and Detection Problems for Condi- 
tionally Independent Increment Processes,’’ Proceedings of the Ninth Annual 
Allerton Conference, University of Illinois, Champaign-Urbana, Illinois, 1971, 
p. 232. 

B. V. Gnedenko and A. N. Kolmogorov, Limit Distributions for Sums of Inde- 
pendent Random Variables (revised), Reading, Mass.: Addison-Wesley, 1968. 

8. J. Taylor, ‘‘Sample Path Properties of Processes With Stationary Independent 
Increments,”’ in Stochastic Analysis, ed. D. G. Kendall and E. F. Harding, 
New York: Wiley, 1973, pp. 387-414. 

B. W. Stuck and B. Kleiner, “‘A Statistical Analysis of Telephone Noise,” 
B.S.T.J., 68, No. 7 (September 1974), pp. 1263-1320. 

L. A. Shepp, ‘Distinguishing a Sequence of Random Variables From a Translate 
of Itself,’ Ann. Math. Statist., 36, No. 4 (August 1965), pp. 1107-1112. 


1196 THE BELL SYSTEM TECHNICAL JOURNAL, OCTOBER 1976 


Copyright © 1976 American Telephone and Telegraph Company 
THE Bett SystEM TECHNICAL JOURNAL 
Vol. 55, No. 8, October 1976 
Printed in U.S.A. 


Alarm Statistics of the Violation 
Monitor and Remover 


By G. S. FANG 
(Manuscript received March 1, 1976) 


Four of the Bell System digital transmission systems, T1 Outstate, 
T2, 83A-RDS (radio system at T8 rate), and T4M, have violation monitor 
and removers (VMRs) located at the recewving-end maintenance offices. 
Among other things, they monitor the lines, remove violations in the pulse 
transmission code, and generate alarms to initiate maintenance actions. 
This paper investigates the alarm statistics of the four types of VMR under 
the assumption that the information bits are statistically independent. It ts 
found that all the VMRs have very sharp alarm thresholds. The results 
of the T£4M VMR are presented in detail. Curves are given to show the 
various statistics obtained. 


I. INTRODUCTION 


Digital transmission systems serving large numbers of message 
channels should be continuously monitored to check the quality of 
service. This can be achieved by putting monitors at maintenance 
offices along the digital transmission route. An ideal monitor should 
provide the exact number of errors made in transmission. Since line 
errors cannot be directly measured in service, alternative criteria have 
to be used for performance monitoring. For instance, bipolar coding! 
can be employed so that the monitor can detect line errors from the 
violations of the coding sequence, and parity bits can be inserted into 
the transmitted digital stream so that the monitor can detect line 
errors if the received parity bits differ from those calculated from the 
received signal. The monitor generates alarms to initiate maintenance 
actions when the detected violation rates are greater than a prede- 
termined threshold. 

In some cases the digital stream has a periodic, identifiable pulse 
sequence called ‘frame format” to which the monitor must synchronize 
before it can detect violations. The monitor is said to be in-frame when 
it recognizes the location of the frame-pulse sequence. High line-error 
rates may alter the frame pulses such that they are unrecognizable by 
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the monitor, which is then said to be out-of-frame. The in-frame 
condition is necessary to identify the various signal components that 
are multiplexed to form the digital stream. The monitor removes all 
violations detected so that violations do not propagate beyond the 
maintenance office; hence, the name ‘‘violation monitor and remover” 
(vMR) was designated. Removal of a violation is not an attempt to 
correct the line error. It is perfomed to guarantee that the vmr output 
is violation-free so that if an alarm condition exists, it will not propa- 
gate to the next maintenance office. The vmr performs other functions 
as well. If it is out-of-frame, a pseudorandom signal with proper frame 
format will replace the received digital stream at the vmMr output in 
order to prevent alarm propagation. 

Four of the Bell System digital transmission systems, T1 Outstate, 
T2, 3A-RDS (radio system at T3 rate), and T4M have vars located 
at the receiving-end maintenance offices. The T1 Outstate system uses 
bipolar coding. The T2 system utilizes B6ZS! (bipolar with six zeros 
extraction) coding. Both the 3A-RDS and the T4M systems employ 
added parity bits for performance monitoring. The vmr for each system 
has its own alarm rules. The durations of time for alarm generation 
and alarm release at various error rates are important system param- 
eters. This paper investigates the alarm statistics of the four types of 
vMR under the assumption that the information bits are statistically 
independent; i.e., each bit is a Bernoulli trial. The derivations for the 
T4M vr? are presented in detail in Section II. Those related to the 
other vmrs are discussed in Appendix A. Section III discusses. some 
of the results obtained and their significance in digital transmission 
systems. 


ll. THE T4M VMR 
2.1 Alarm strategy 


The T4M digital transmission line? has a transmission rate of 274 
megabits per second (Mb/s) with the information transmitted in a 
binary format. Its frame format? contains 196 bits of which 192 are 
information bits and 4 are housekeeping bits. One of the latter is a 
parity bit used to check the 192 information bits. The alarm strategy 
of the vmr at low-parity violation rates is implemented in the following 
manner. The first single parity violation that is observed triggers a 
100-ms timer and a counter. If the counter accumulates more than 31 
parity violations before the 100-ms measuring timer times out, a 
3-ms waiting timer is immediately triggered. At the end of 3 ms, 
another 100-ms timer is triggered and the counter starts counting 
again. During this second 100-ms period, if the counter overflows; i.e., 
it accumulates more than 31 violations, a vmMR alarm is generated 
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immediately. The 3-ms waiting timer is employed so that a short 
burst of errors will not cause an alarm. Since the transmission rate is 
274 Mb/s and violation is checked once every 196 bits, the alarm 
threshold violation rate is set at 


31 X 196 


ne ee —3 
D74 X 108 x Oa ~ 9-222 X 10. 


It will be shown in the next section that this violation rate corresponds 
to an error rate of approximately 1.1 X 10-°. 

To avoid oscillatory alarms near the threshold violation rate, 
hysteresis is designed into the vMr alarm system. A 1-second release 
timer is used to measure the violation rate when the vr is in the 
alarm state. The release timer is free-running and is not synchronized 
to the vmr alarm. The alarm is released only after a full duration of 
the release timer is passed and the 31-violation counter does not 
overflow. Thus, whenever an alarm is generated, it will last at least 1 
second. This produces a release-error-rate threshold of about 1.1 X 1077. 

When the var is out-of-frame for 0.5 ms, a pseudorandom signal 
with the proper frame format is switched in to provide a violation-free 
output. As soon as the vr is back in-frame, the violation counter is 
reset and starts counting until the 1-second free-running release timer 
times out. If the counter does not overflow, the pseudorandom signal 
is then switched out. Thus, after a failure is restored, it takes anywhere 
from 0 to 1 second to switch out the pseudorandom signal. 


2.2 Bit error rate versus parity violation rate 


Since the digital transmission line performance objective is usually 
set in terms of the bit error rate, which cannot be directly measured 
in service, it is desirable to establish the relationship between the 
parity-violation rate and the bit-error rate. Let 1 be the number of 
information bits contained in each parity check. Then, 

P{parity violation} = P{odd number of bit errors in J bits} 
-P{the parity bit is correct} 
+ P{even number of bit errors in J bits} 
-P{the parity bit is in error}. (1) 


In what follows all random variables are in boldface type. Let the 
bit-error rate and the parity-violation rate be represented by «and , 
respectively. For each realization of ¢, (1) can be written as 


v= (1 — b,)(1 — «) + die, (2) 


where b; denotes the probability of having an even number of bit 
errors in 1 information bits. This event occurs if a correct first bit is 
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followed by an even number of bit errors or if an incorrect first bit is 
followed by an odd number of bit errors. Therefore, for / = 1, 


by —_ (1 =: €)bi-1 + e(1 sages bi-1), bo = 1. (3) 


Define the generating function 
B(S) = 2 bS' ~-1<S8 <1. (4) 
i=0 


Multiplying (3) by S' and adding over / = 1, 2, ---, we obtain 


B(S) — 1 = (1 — &™)SB(S) + S(1 — 8S) — &SB(S) (5) 
or 


B(S) = 3{(1 — 8)*+ [1 — (1 — 26e)ST"}. (6) 


Expanding into geometric series, we get 


— Je)! 
—_ At = 20) (7) 
which is equivalent but preferable to 
b= w) (1 — 6) + fe) e(l — 6)? +---, 
Substituting (7) into (2) 
= (1 = 96)! — 2)! 
p = 1 On? x 4 E 8 xe (8) 


Equation (8) establishes the relationship between the parity-violation 
rate and the bit-error rate. When le < 1, it is easy to see that 


vx (l+ le. (9) 
In the T4M frame format, / = 192. Therefore, 
v 1936. (10) 


Equation (10) is intuitively obvious because only errors occurring in 
the 192 information bits and the parity bit are counted by the var. 
Since a parity check is made every 196 bits, let «’ = v/196, e’ can be 
considered as the measurable bit-error rate. It differs from ¢ by about 
1.5 percent when (10) holds. 

Figure 1 plots the parity violation rate versus the bit error rate 
based on (8) with / = 192, assuming the vmr stays in frame. We see 
that for bit-error rates below 10-%, there is almost a one-to-one corre- 
spondence between a bit error and a parity violation. Above 107%, 
the vmMR may go out of frame. 
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Fig. 1—Parity-violation rate vs bit-error rate. 


In this paper, only low-parity-violation rates are being studied. 
Thus, ¢ will be used in place of ¢’ for simplicity. 


2.3 Statistics of the alarm interval 


Let y represent the transmission rate, N the number of independent 
violations incurred, and T the time spent to count the violations. By 
the Bernoulli trial assumption, 


Pin Sal? ote Sey= ee en(1 — evn, (11) 


In this paper, only conditional distributions are discussed in most 
cases. For simplicity, conditions such as e = e, N = n, and T = é are 
not expressed explicitly when they are understood. 

Since yt is large, by De Moivre-Laplace limit theorem, a normal 
approximation to the binomial distribution is applicable. 


PIN & nj wv — 4 (eae) (12) 
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Fig. 2—Probability of counter overflow vs bit-error rate and time. 
where 
(2) = f° ee #ra (18) 
() = [7 Aeeora 1 
—o 20 
is the cumulative normal distribution. Let 
w= PIN 2 n}, (14) 


pis the probability of counter overflow given an error rate and a fixed 
timer. Figure 2 shows, on a probability scale, this probability as a 
function of the bit-error rate for tj = 100 ms. The same curve with a 
different ordinate also shows the probability as a function of time for 
e = 10-*. It can be seen that when the error rate varies from €)/2 
to 29, the probability of counter overflow varies from 0.0001 to 0.9999. 
Thus, the threshold is very “hard.” 

Let M be the random variable such that the vmr alarm is generated 
at the Mth measuring period. Each period is 100 ms if the counter 
does not overflow. It is desirable, then, to determine the probability 
Dm, m = 0, 1, 2, ---, that the vmr will generate an alarm at the mth 
measuring period, given T = ¢ and e = e. If 1 represents the event 
that during a measuring interval the counter overflows and 0 repre- 
sents the opposite, the m periods must be of the form 


MX KX OA 
~~ ———qw~Y 


m—3 
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where the sequence of m — 3 Xs does not have any 1 1 pair within it. 
Hence, 


pn = P{M =m} (15) 
= P{1 1 does not occur in a sequence of length m — 3}-p{011} 
m—3 
= (1 — 2s, p.) (1 — p)y?. (16) 


By definition, po = pi = 0, pe = p?, and p3 = (1 — u)p?. Denote 
dm = P{M > m} 


=l]- = Di; (17) 
i=0 


then 
Dm = Qm—3(1 — y)y?. (18) 


Define the generating functions as 


P(S) =  pSt = -1 S881 (19) 
Q(S) = Ds aS* ~1<S8<1. (20) 

Then, 
(1 — S)Q(S) = 1 — P(S), (21) 


as can be seen by comparing the coefficients of any S* terms on each 
side. From (19), (18), and (20) 


P(S) = #S? + (1 — »)uS*Q(8). (22) 
Equations (21) and (22) give 


wS?(1 — pS) 


T-S+ 0 - os" oa 


P(S) = 


From (23), the statistics of M can be derived. For instance, the mean 
and the variance are 


E{M} = a Mpa 
=lim P’(S) 
S>1 


1 
= 5 age (24) 
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Var {M} = lim [P"(S) + P'S) — P2(S)] 


(1 — nw) (1 + 8p + 2B) 


ri (25) 


Higher-order statistics of M can be similarly obtained. At the threshold 
error rate, €0, u = 3, E{M} = 6. Thus, the expected alarm time is 
approximately 600 ms. The threshold variance is 22, which is quite 
large. 

Two standard methods are available to evaluate the probability 


coefficients pm, m = 1, 2, ---. The first one is 
sd NS) 
Pm = lim — m=0,1,2,°-°>. 


The second one is through partial fraction expansion of (23). Both 
methods require extremely tedious derivations. A simple alternative 
is presented in Appendix B which first expands the denominator of 
(23) as follows ; 


1 2 ; 
1—S+ 0 — ast = 2,08 oo 
with 
Co = Cy = Ca = 1, 
and 
C; = Ci-1 — (1 = u) uC 5-3 1 = oO (27) 
From (19), (23), (26), and (27) 
Pm = WC m2 = WC m—s m = 3. (28) 


Equations (27) and (28) provide an attractive way to evaluate the 
probability coefficients pm’s. What is more, pm can be obtained without 
first calculating pPm—1, DPm—2, etc. It is interesting to note that for any 


error rate, 
P2 = 
Ps = pa = (1 — uw)? 


Dm > Dm+i m2 4. 


Thus, the probability that the vmr will generate an alarm during the 
second measuring period is always the largest, regardless of the error 
rate. The probability decreases monotonically at later measuring 
periods. 
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The cumulative distribution function of M is 


Fu(m) = P{M < m} 


T 
M 
= 


Since the duration of each measuring period is not greater than 
to = 100 ms, the length of the timer, 


P{vmr has generated an alarm in mty ms} = D> px. 
k=0 


This equation can be used to plot the lower bound of the alarm prob- 
ability as a function of time. 


2.4 Distribution of violation measuring time 


The distribution of the measuring time T, assuming N = n and 
e = ¢, 1s considered next. If we let Y be the number of error bits prior 
to the nth error, then Y has the negative binomial distribution 


ay afl¥t®~!) ag — 
PLY = yj =( aap Jea 64. 
The time elapsed for the nth error to occur, 


_Y¥+n 
oar 


T 


has the probability-density function (PDF) 
= — vt ~~ 1 n a t—n 
PIT = =7(% 7) ea rin, (29) 


Equation (29) is the distribution of the discrete violation measuring 
time T given that N = n and e = e. The T4M vr has the additional 
condition T S t) = 100 ms; i.e., each measuring period is no greater 
than 100 ms. Let this censored random variable be denoted by T,. 
It is now desirable to find the distribution of T., given that N = n, 
e = «, and T, S ¢. Unfortunately, this task is difficult to perform in 
the discrete sample space. However, since each information bit is 
3.65 ns long while the T, of interest is in milliseconds, the discrete 
censored random variable can be considered continuous for ease of 
calculation. From the Poisson theorem, (11) can be approximated by 
the Poisson distribution 

PINS ap = en 7 (yel)” (30) 


n! 
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Let Y:, 7 = 1, 2, ---, m represent the time from the (7 — 1)th error 
to the zth error, then its pp¥F is given by 


fx(t) = yeew™', 
Through the use of the characteristic functions, it is easy to see that 
the sum 


has the gamma distribution 


frQ = i te OSt<o, 


By successive integration by parts, it can be shown that 
k 
[> feat = corn’ Se 
k=0 
Thus, the censored random variables T, has PDF 


Sr(t) + 5(t — to)e*r% > re gels 


lA 


fr() = (31) 


0 t Sty, 
where 6(é — fo) is the delta function. Its characteristic function T(w) is 
ig k 
Te = pes bate ( — e (€r—Jw) to PAY = jw) #) 
) (ey — jw)” : a = k! 
| + e-(ey~iw) to = (evto)* 
kao =k I 


The mean 7;, is given by 

dT .(w) 20 5 de esis k(eyto)"—* 
jdw juno €Y  €Y ka1 (n—k)! 
The variance o7 can be evaluated similarly. The first term on the 
right of (32) is the mean value of T. The second term is present be- 
cause of the additional restriction T S t. At the alarm-error-rate 
threshold, 71, & 97 ms, n/ey = 100 ms, the contribution of the second 


term is about 3 ms. 


(32) 





Nee = 


2.5 Distribution of the alarm time 

Let To represent the time it takes the vmr to generate an alarm at 
a given error rate. It is desired to find the ppr of To. Let T:, 7 = 1, 
2, -°:, M, represent the time from the (¢ — 1)th to the 7th measuring 
interval, neglecting the 3-ms waiting time. The ppr of T; is given in 
(31). The alarm time is then 

M 
To =>, Ts 


t=] 
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Note that T> is the sum of a random number of random variables.!% 
Through the use of conditional probability, since M and T,’s are 
independent, the ppr of the random sum has a compound distribution 


fr(t) = LX Pml® 0), 


where Pm is given in (28) and f(t) is the m-fold convolution of 
fr.(t) with itself. The characteristic function of To is 


Toe) = Y pulT.(w) (33) 


The right side of (33) is the Taylor expansion of P(S) obtained in 
(23) with S replaced by 7'.(w). Thus, 





To(w) = PLT .(w) J. (34) 
The mean and the variance of To are 
1 to we N te 
(35) 
o_ ttn ,, A-vw@ltate) , 
on = pe te fs Ntey 


where 7:, is given in (32). Equation (85) is used to plot Fig. 3 which 
shows the mean alarm time versus the error rate. It can be seen that 
the mean alarm time decreases very fast as the error rate increases. 
The total alarm probability after time ¢ is 


P{Tr St} = [ fa,(Dat 


X Pm | £2 (at. (36) 


™m 


2.6 Waiting time distribution 


In the above analyses, the 3-ms waiting intervals have not been 
taken into account. The waiting timer is triggered after each counter 
overflow. The distribution of the waiting periods is studied next. Let 
W be the number of times the waiting timer is triggered before a vMR 
alarm is generated, assuming that M = m. The last three measuring 
periods before a vmr alarm should be 011 [notations are defined 
before (15) ] and the waiting timer is definitely triggered once. Let 


A, = the event that 11 does not occur in / measuring intervals. 
This event occurs if the counter does not overflow in the first measuring 


interval, followed by the event A;,_1, or the counter overflows in the 
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Fig. 3—Expected alarm time vs error rate. 


first interval but not the second interval, followed by the event Aj_2. 
Let 
a= P{Aj}, 
then 
a, = w(l — ware + (1 — war L> 2. (37) 


The generating function A(S) is defined as 

A(S) = x as! 1283 1, 
Multiplying (37) by S' and summing from | = 2 to infinity, 
S ay! = w(1 — n)S? X aoS'* + (1 — w)S¥ arsS'*. (38) 


Since ao = ai = 1, (38) can be written as 


A(S) — S— 1 = w(1 — w)S?A(S) + (1 — w)STACS) — 1], 
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then 
1+ u8S 
dL pS pS? 


Therefore, each P{A,}, 2 = 2, can be obtained by the method indicated 
in Appendix B. Let X be the number of times the waiting timer is 
triggered in / measuring periods, given that the event A, is true, then 


P{X > j, Ai} 
P{Aiy 


A(S) = 


P{X = j/Ai} = (39) 


Let P{-|0} denote the conditional probability assuming the counter 
does not overflow in the first measuring interval, and P{-|1, 0} denote 
the conditional probability assuming the counter overflows in the first 
but not the second interval. The numerator of (39) can be written as 
P{X = j, Ai} = P{X = j, Ail 1} P(1} + P[X = 3, AiO} P {0} 
= pP{X = j,Ai]1} + (1 — »)P{X = j, Ail0}. (40) 
However, . 
P{X = j, Ail} = P{X = j, Ail, YP{1/1} 
+ P{X = j, Ail1, O}P{0|1} 
=0+ (l—4#)P{X= j —1, Ar} (41) 
P{X = j, A,|0} = P{X = j, Avs}. (42) 
Insert (41) and (42) into (40); then, 
P{X = j, Ai} =u — w)P{X = 7 — 1, Ars} 
+ (1 — »)P{X = j, Avi}. (43) 


pir = P{X = j, Ai}. 


Let 


Equation (43) can be written as 
Pit = WC — w)pj-1,-2 + (1 — 2) p;,1-1. 


Following the derivation of (38), we obtain 


o 


Sd pj aSiSh = w(L — w) S83 OY pjar-2S{-4Ss 
j=l 1=2 j=1 1=2 
+ (1 = w)S2 DY pj rrSiSy". (44) 
ik 
Define the bivariate generating function A (Sj, Sz) as 


A(S1, S2) = > > D;,S{S>. 
j=0 i=0 
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Since 


Pit = 90 ee oe eg 
po, = (1 — p)! l= 0,1, 2, 
P11 = H, 
eq. (44) can be reduced to 
1 
A(Si, S2) rs t= —.2)8; — pSyS2 = ul _ bw) SyS3A (Si, S») 
1 
+1 -)8:| As 8) - pag |; 
Therefore, 
A(S1, Se) = noes (45) 


1 — (1 — w)S. — w(l — 2)S1S3_ 


An iterative expression similar to that given in Appendix B can be 
obtained for the evaluation of p;,., and, hence, P{X = j| Az} according 
to (39). Note that P{X = j|A.} = Ofor j > 1/2 +1. 

An example is given below. When ¢ = €, » = 3, it was shown in 
(24) that on the average six measuring periods are required for the 
vmr to generate an alarm. During the last three periods (011), the 
waiting timer is triggered once. It is desirable to find the distribution 
of X in the first three periods. From (37) and (45), 

P{As} = 1 — 2p? + pi 
= (1 — n)8 
pis = 3u(1 — pu)? 
P23 = wl — H) 
P33 = 0 


i xX P(X = s|Ay} 


u(l — #)(3 — w). 
1 — 2y? + p 


Ss 
be 
l 


E{X|A3} 


At the threshold, u = 3, 

E{X|As} = 1. 
Thus, in the first three measuring intervals, the waiting timer is 
expected to be triggered once. In the last three intervals (011), the 
waiting timer is definitely triggered once. Hence, if the alarm occurs 
at the sixth measuring interval, then 


E{W|M = 6} = 2. (46) 
Equation (46) says when ¢« = ¢9, the waiting timer shall be, on the 


average, triggered twice before an alarm is generated. 
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2.7 Statistics of alarm release and oscillation 


To avoid oscillatory alarms near the error threshold, a release timer 
with duration d > ty is used to measure the violation rate when the 
VMR is in the alarm condition. The alarm is released only after the release 
timer times out and the counter does not overflow. Let » be the prob- 
ability of counter overflow during the measuring period d. From (12) 


Seo at (. ) 
Vyde(1 — 6) 
Let K represent the number of measuring periods before the vmr 
stops alarming; 1.e., the var will release the alarm at the (K + 1)th 
period. Then, 
hp = PK =k} 
= (1 — v)v* k = 0,1, 2,---. (47) 
Thus, K is governed by a geometric distribution with generating 
function 
1—p 
1— wW 
The distribution of the alarm-release time D (assuming the error 
rate remains constant) will be derived first. Let D; represent the time 
from the (¢ — 1)th to the 7th counter overflow during the alarm state. 
The distribution of D; is given by (29) and its generating function is 
eS1/7 





H(s) = (48) 


Dis) = TW Oa (49) 
The alarm-release time is again given by a random sum 
k 
D= > D;+ 4d, (50) 


7=0 
where by definition, D) = 0. Since K and the D,’s are independent, 
the generating function of D is 
e eStly kn 
D(s) = S4 = hy (ass) 
= S4H[D,(S) ]. (51) 
The ppr of D is the compound distribution 


— ~ y(« = d) Sxl kn _— ¥y (z—-d)—kn 
fo(a) = So hay ( TE OT) ed — resin, (62) 


The mean and the variance of D are 











v n 
aa eer ae (53) 
Bic! 2 i id <n 
a= 2 xgx(- 41-5 (54) 
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Fig. 4—Average first-oscillation time vs duration of release timer. 


The first oscillation time T,, i.e., the time it takes for an alarming 
vmRk to release and then generate another alarm, assuming the error 
rate remains constant, is 


T,.=D+To. (55) 
Its PpF is simply the convolution 
fra(t) = fo(t)*fro(t). 
The mean and the variance of T, are 
Nta = Na 1 Nto- (56) 
of = a4 + of. . (57) 
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Equation (56) is employed to plot Fig. 4 which shows the expected 
first oscillation time versus the duration of the release timer d with 
the error rate as the parameter. When « = 10~, if d = 330 ms, alarm 
oscillation is expected to occur once in 187 years; if d = 1 second, 
alarm oscillation is extremely unlikely to occur. 

The time from an initial alarm state to the 7th alarm oscillation is 
@ X T., whose distribution can be easily obtained from that of Ty. 


2.8 Pseudorandom signal switching statistics 


As described in 2.1, after the pseudorandom signal is switched in, 
if the vmR is back in-frame, immediately the violation counter is reset 
and starts counting again until the free-running release timer of 
duration d times out. Since the in-frame condition can occur anytime 
within the interval 0 to d, the time spent to count the violations is 
uniformly distributed between 0 and d. When the release timer times 
out, the number of violations counted is a mixture distribution ob- 
tained through randomization’ of the parameter ¢ in (30) 


P{N = ne = ¢} = ess 


ene (1 — gre = ak). (58) 


ved 1 


1 
x git 





The por f.(e) of the error rate « is usually unknown. If f.(e) is given 
or can be estimated empirically, (58) can be randomized by f;(e). 


P{N =n} =f P(N = nle = e}f.(dde, (59) 


where the upper integration limit is determined by the domain of e. 
From (59), the probability P{N <n} that the counter does not 
overflow, 1.e., the pseudorandom signal will be switched out, can be 
evaluated. 


2.9 Generalizations 


All the above derivations are general enough so that if one requires 
the counter to overflow consecutively more than twice (with the 
waiting timer triggered each time the counter overflows) before an 
alarm is generated, the results can be easily extended. For example, 
if the vmr generates an alarm after k consecutive counter overflows, 
then (24) becomes 
E{M} = t—# 

(1 — p)p®’ 
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and the probability coefficients are 


Dm = 0 m=0,1,---,k-1 
pr = p* 
Pktl = Deze = = Por = (1 — p)p*® 


Dn > Pmt me 2k+1 


Ill. DISCUSSION 


As discussed in the introduction, one of the main functions of the 
VMR is to generate alarms when it detects that the line performance is 
below a predetermined objective. However, the digital line performance 
objective is usually set in terms of a threshold bit-error rate which 
cannot be directly measured in service. Equation (8) establishes the 
relationship between the bit-error rate and the parity-violation rate 
for a digital line employing parity-checking digits. Figure 1 shows 
that for the parity-check structure used in the T4M system and for 
bit-error rates below 10-%, there is almost a one-to-one correspondence 
between a bit error and a parity violation. This implies that the parity- 
checking scheme is effective in determining digital transmission line 
performance. 

When the T4\I var parity violations exceed a specified threshold 
in two consecutive measuring intervals, an alarm is generated. This is 
normally followed by an automatic transfer of the failed line to a spare 
line if the latter is available. In general, each spare line will protect 
several service lines to reduce system cost. Thus, a so-called ‘‘hard’”’ 
alarm threshold, which clearly distinguishes between error rates slightly 
above and below the threshold, is desirable because it is unlikely to 
cause an alarm at error rates below the threshold. In this case, the 
spare line will be available to protect more serious failures on other 
service lines. It also takes less time for a vmR with a hard threshold to 
generate alarms when the error rates are above the threshold. Equation 
(14) gives the probability of the parity-violation counter overflow as 
a function of the error rate and the duration of the measuring interval. 
Figure 2 is a plot of (14) and exhibits the desirable hard threshold 
characteristics. As the error rate varies from 0.6 X 10~® to 2 X 10~, 
the probability of counter overflow changes from 0.0001 to 0.999. 

When a catastrophic failure occurs on a line, its vmr should generate 
an alarm as soon as possible so that an automatic transfer to a spare 
line can take place without trunk disconnection. When an error rate 
just above the threshold is detected, little harm will be done if the 
var takes longer to announce an alarm. Equation (35) obtains the 
mean alarm time as a function of the error rate. From Fig. 3 it can be 
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seen that the mean alarm time of the T4M vmr reduces very fast 
with increasing error rates. Note that if an error rate of 0.7 X 10-8 
(slightly below the threshold) persists for hours, eventually an alarm 
will be generated because the alarm threshold is not infinitely hard. 

The amount of hysteresis required in releasing an alarm is an 
important part of vmMr design. The release timer should be long enough 
so that oscillation between alarm request and alarm release is unlikely 
to occur. It should also be short enough so that alarms are not un- 
necessarily prolonged. Equation (56) gives the expected oscillation 
time as a sum of the mean alarm time and the mean release time, 
both of which are functions of the error rate and the length of the 
release timer. Figure 4 shows that when the duration of the release 
timer is greater than three times that of the parity-violation measuring 
interval, alarm oscillation is not likely to occur at any constant error 
rates. This is due to the fact that the mean alarm time is large for 
error rates below the alarm threshold while the mean release time is 
long for error rates above the alarm threshold. 


APPENDIX A 


The T1 Outstate (1.544 Mb/s) vmr counts 16 bipolar violations 
(violations occurring within a 0.3-ms interval are counted only once) 
in 85 ms to generate an alarm. The T2 (6.312 Mb/s) vmr generates a 
low-error alarm if it counts 32 bipolar violations in 5 seconds (violations 
occurring within a 3.2-us interval are counted only once). Since the 
error rates of interest are near the threshold, it can be assumed that 
no two violations occur ‘“‘close”’ to each other. The 3A-RDS (44.736 
Mb/s) vor generates an alarm if it counts 31 parity violations in 2 
seconds. These alarm rules are simpler than that for the T4M vr, 
hence, the alarm statistics of these vmRs are also easier to derive. For 
the vmR of each system, a probability of counter overflow u can be 
derived as in (14). This probability is also the probability of alarm. 
The three alarm rules have identical mathematical models; hence, 
no separate discussions are necessary. 

Let M represent the number of elapsed measuring periods before 
the vMR generates an alarm; i.e., the vmMr will generate an alarm at 
the (M + 1)th period. Then, 


Pm = P{M = m} 
= p(1 — »)” m= 0,52, 


M is governed by a geometric distribution. Most other statistics dis- 
cussed in Section II can be derived similarly. 
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Because of the simplicity of the geometric distribution, given a 
probability of alarm p, the number of elapsed periods k before the 
‘probability p is reached can be obtained explicitly, 


PSL Pm 
k 
= 2 wl —#)® 
=1- (1-4). 
Therefore, 
a In(l—p) _ 
~ In(1 — p) 


In each of the first k measuring intervals, the counter will not overflow 
when the measuring timer times out. Thus, the total alarm probability 
as a function of elapsed time can be plotted easily as opposed to 
evaluating (36) for the T4M vnr. 


APPENDIX B 


This appendix derives an iterative expression to calculate the prob- 
ability coefficients p,’s discussed in Section 2.3. Specifically, given that 


P(S) = ¥ pS! (60) 
and 
3 a;S? 
PS)S 3S (61) 


1+ > DS 
1=1 


it is desired to obtain the p,’s in terms of the a,’s and the b,’s. Let the 
denominator of (61) be expanded as follows 


1 _ Fas 
1+ > bs! *° 
i=0 
C;,7 = 0, ---, n — 1, can be determined through long division or by 


comparing the coefficients of the S‘’s in 


ee (1 +> v8*)( - } 


For 7 2 n, 


OCS BO: 
k=l 
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Thereforé 


P(S) = = ai XO C,S*. (62) 


1=0 


Compare (60) and (62), p:, 7 = 0, 1, ---, m — 1 can be determined 
easily. For 7 = m 


Pi = » anC ina 


A=0 


blir | 


iM 


Py me | - 


1 


Dd Gnd Cy—n—x- 
h=0 k=l 
p: can be calculated by computer without knowing p:-1, pi-s, ete. 
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digital network theory, and digital filter structures. He is presently 
engaged in research in speech communications and digital signal 
processing. Associate editor, JEEE G-ASSP Transactions. Member, 
Sigma Xi and IEEE G-ASSP Technical Committee on Digital Signal 
Processing. 


G. S. Fang, B.S.E.E. 1967, National Taiwan University ; PhD, 1971, 
Princeton University; Computer Sciences Corporation 1971-72; Bell 
Laboratories, 1972—. At Bell Laboratories, Mr. Fang has worked on 
high-speed digital transmission, protection switching, and micropro- 
cessor applications. 


James L. Flanagan, Se.D. (E.E.), 1955, Massachusetts Institute 
of Technology; Bell Laboratories, 1957—. Mr. Flanagan has worked 
in voice communications, acoustics, and digital techniques for signal 
coding and transmission. He is Head, Acoustics Research Department. 
Fellow, ILEEE; Fellow, Acoustical Society of America; Board of 
Governors, American Institute of Physics; member, Sigma Xi; Tau 
Beta Pi. 


Franz T. Geyling, B.S., 1950, M.S., 1951, and Ph.D., 1954, Stanford 
University; Bell Laboratories, 1954—. Mr. Geyling has engaged in 
solid mechanics research and experimental stress analysis. From 1959 
to 1970 he specialized in orbital mechanics and was responsible for 
tracking and control studies of missiles, satellites and spacecraft. He 
also contributed to the structural analysis of ground antennas and 
suspended underwater arrays. Since 1971, he has been engaged in 
continuum physics as applied to materials research. He is coauthor 
of a book and has served as associate editor of the SIAM and AIAA 
Journals, as well as chairman of the AIAA Astrodynamics Specialist 
Committee. Associate Fellow, AIAA; Member, APS, Rheol. Soc., 
ASME, SIAM, Phi Beta Kappa, Tau Beta Pi, Sigma Xi. 
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David J. Goodman, B.E.E., 1960, Rensselaer Polytechnic Institute ; 
M.E.E., 1962, New York University; Ph.D. (E.E.), 1967, Imperial 
College, London; Bell Laboratories,1967—. Mr. Goodman has studied 
various aspects of digital communications, including analog-to-digital 
conversion, digital signal processing, assessment of the quality of 
digitally coded speech, and error mechanisms in digital transmission 
lines. In 1974 and 1975, he was a Senior Research Fellow at Imperial 
College, London, England. Member, IEEE. 


Sing-Hsiung Lin, B.S.E.E., 1963, National Taiwan University ; 
M.S.E.E., 1966, and Ph.D., 1969, University of California, Berkeley ; 
Bell Laboratories, 1969—. At the Electronics Research Laboratory, 
University of California at Berkeley, Mr. Lin engaged in research 
on antennas in plasma media and numerical solutions of antenna prob- 
lems. At Bell Laboratories, Mr. Lin is working on wave propagation 
problems on terrestrial radio systems and earth-satellite radio systems. 
Member, IEEE, Sigma Xi, AIAA. 


Barbara J. McDermott, B.A. (Psychology), 1949, University of 
Michigan; M.A. (Psychology), 1963, Columbia University; Bell 
Laboratories, 1959—. Ms. McDermott has worked on speech quality 
evaluation and multidimensional scaling analysis. Member, Acoustical 
Society of America. 


Lloyd H. Nakatani, B.A. (Mathematics), 1964, Ph.D. (Psychology), 
1968, University of California; Bell Laboratories, 1968—. Mr. 
Nakatani has worked on speech quality evaluation and on speech 
perception and synthesis research. Member, Acoustical Society of 
America. 


Philip J. Rich, B.S., 1972, University of Illinois, M.S. (Physics), 
1974, Georgia Institute of Technology, Bell Laboratories, 1974—. 
Mr. Rich is currently engaged in studies related to the characterization 
and splicing of optical fibers. 


B. W. Stuck, 8.B.E.E., S.M.E.E., 1969, Se.D., 1972, Massachusetts 
Institute of Technology; Bell Laboratories, 1972—. Mr. Stuck has 
worked on problems in applied probability theory and mathematical 
physics. He is presently concerned with modeling and performance 
analysis of large digital systems. Member, SIAM, MAA, IEEE. 


Susan A. Webber, B.A. (Mathematics), 1972, Vassar College; Bell 


Laboratories, 1973—. Miss Webber is engaged in scientific program- 
ming for laboratory computer systems dedicated to speech research. 
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Abstracts of Papers by Bell System Authors 
Published in Other Journals 


CHEMISTRY 


The Anodic Behavior of Gold in Sulfuric Acid Solutions Effect of Chloride and 
Electrode Potential. R. P. Frankenthal and D. E. Thompson, J. Electrochem. Soc., 
123 (June 1976), pp. 799-804. Gold corrosion in the passive and transpassive 
potential regions 1s affected by Cl-. Soluble Au(III) and Au(OH); film are the 
reaction products. Chloride promotes dissolution and restricts film formation. 
Mechanism of O2-evolution is different on a filmed than on a film-free surface. 


Brillouin Scattering from Polymer Films. G. D. Patterson, J. Polym. Sci., Part A-2, 
Polym. Phys., 14 (1976), pp. 143-149. Many polymers cannot be prepared as 
clear amorphous blocks suitable for classical light-scattering studies. However, most 
linear polymers can be prepared as films which are somewhat transparent. With the 
advent of high-contrast multipass interferometers, these films can now be studied 
by Brillouin scattering. This work demonstrates the wide range of polymeric ma- 
terials that can now be studied by Brillouin spectroscopy. 


Direct Measurement of Spontaneous Predissociation Using Coaxial Laser-Molecular 
Beams. R. M. Lum and K. B. McAfee, Jr., J. Chem. Phys., 63, No. 11 (December 
1975), pp. 5029-5033. A laser-molecular beam technique has been devised to 
enable direct observation of radiationless transitions of isolated single molecular 
states. Spontaneous predissociation, detected as a modulation of the molecular beam, 
has been observed in Brz at laser wavelengths which produce selective excitation of 
the individual Br: isotopic species. 


Neutron Scattering Study at High Pressure of the Structural Phase Transition in 
Paratellurite. D. B. McWhan, R. J. Birgeneau, W. A. Bonner, H. Taub,* and J. D. 
Axe,* J. of Phys. C. Lett, 8 (1975), pp. L81-L85. The dispersion relation for the 
transverse acoustic phonon mode propagating along (110) polarized along (110) in 
TeOQ, has been measured at P = 1 atm, P. (9.6 kbar), and 2P, where P, is the 
tetragonal-to-orthorhombic structural transition pressure. Measurements of the order 
parameter are consistent with a mean field theory of a pressure induced elastic 
instability. “Brookhaven National Laboratory. 


COMPUTING 


Integrated Injection Logic: A Bipolar LSI Technique. R. A. Pedersen, Computer, 9, 
No. 2 (February 1976), pp. 24-29. Integrated Injection Logic (I?L) is a novel 
bipolar circuit design approach to achieve high-density large-scale integration. As 
the basic logic unit, it uses multicollector npn transistors which are powered from 
merged multicollector lateral pnp transistors. ?7L can be fabricated with standard 
buried collector technology and is therefore compatible with conventional bipolar 
circuitry—digital or linear—on the same silicon chip. 


ELECTRICAL AND ELECTRONIC ENGINEERING 


Behavior of Tandem Buffers with Geometric Input and Markovian Output. J. Hsu 
and P. J. Burke, IEEE Trans. Commun., COM-24 (March 1976), pp. 358-361. A 
discrete-time system of infinite-capacity buffers in tandem is studied. Input to the 
first buffer is geometric and the output for all but the last buffer (which can be 
arbitrary) is Markovian. The analysis shows that, in equilibrium, each buffer can be 
analyzed separately and independently. 


Chemisorption and Schottky Barrier Formation of Ga on Si(111)7 X 7. G. Margari- 
tondo, 8S. B. Christman, and J. E. Rowe, J. Vacuum Sci. Technol., 73 (January— 
February 1976), pp. 329-332. The chemisorption of gallium atoms on $i(111)7 XK 7 
was studied using photoemission, electron-energy-loss, LEED and Auger spectros- 
copy. Most of the states formed at the metal-semiconductor interface are due to the 
first 1-2 metal monolayers and need a microscopic-atomic bonding or surface-band- 
structure theoretical description. 
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A Comparison of Chemical Etches for Revealing (100) Silicon Crystal Defects. 
D. G. Schimmel, J. Electrochem. Soc., 123, No. 5 (May 1976), pp. 734-741. Etch 
pit results of the Sirtl, Dash, Secco, and an experimental etch are compared for 
(100) silicon wafers after various device processing steps. Factors influencing etch- 
pit formation on silicon surfaces are discussed. A recommendation is given for the 
etch formation with the best etch-pit development. 


Fabrication and Performance of Offset-Mask Charge-Coupled Devices. A.M. 
Mohsen and T. F. Retajezyk, Jr., IEEE J. Solid State Circuits, SC-11 (February 
1976), pp. 180-188. The use of the offset-mask technique to fabricate two-phase 
and uniphase charge-coupled device (CCD) electrode structures is described. A 
new two-phase electrode structure with polysilicon-electrodes and self-aligned gates 
for the peripheral circuits has been developed. The polysilicon offset-mask electrode 
structure is very attractive for charge-coupled memories. Compared to other two- 
polysilicon level CCD structures, it has a higher packing density, is more tolerant 
to intralevel shorts, and does not require large numbers of small contact windows 
to connect the gate electrodes to the phase bus lines. 


High Repetition-Rate and Quasi-CW Operation of a Waveguide CO. TE Laser. 
P, W. Smith, C. R. Adams, P. J. Maloney, and O. R. Wood II, Opt. Commun., 16, 
No. 1 (January 1976), pp. 50-53. We report operation of a waveguide CO»: TH 
laser at excitation pulse repetition frequencies as high as 40 kHz. Quasi-continuous 
laser output was obtained yielding an average output power of 1.5 W from an active 
volume of 0.1 cc. Details of laser construction and excitation circuitry are given. 


Hydrogen Embrittlement of Electroless Copper Deposits. Y. Okinaka and S. 
Nakahara, J. Electrochem. Soc., 123 (1976), pp. 475-478. Electroless copper 
deposits were investigated for ductility, impurity content, void distribution, and 
fracture surface morphology. It is concluded that the brittleness of electroless copper 
deposits is due to the internal hydrogen embrittlement rather than the incorporation 
of cuprous oxide or morphological effects. 


Multicomponent Photopolymer Systems for Volume Phase Holograms and Grating 
Devices. W. J. Tomlinson, E. A. Chandross, H. P. Weber, and G. D. Aumiller, 
Appl. Opt., 16, No. 2 (February 1976), pp. 534-541. Novel photopolymer 
systems for the fabrication of high-resolution volume phase holograms and gratings 
devices are reported. We use a mixture of components chosen to have differing 
reactivities and polarizabilities. The image-forming exposure results, ultimately, in a 
modulated chemical composition. Peak-to-peak refractive index differentials of up 
to 1.5 percent were achieved. 


Multipoint Private Line Access Delays Under Several Interstation Disciplines. 
C. D. Pack and B. A. Whitaker, IEEE Trans. Commun., COM-24 (March 1976), 
pp. 339-348. Performance objectives for some types of computer-communications 
networks are stated in terms of access-delay statistics which measure the grade of 
service experienced by stations bidding for access to a multipoint private line (MPL). 
Using simulation and some analysis, we examine the access delay statistics for an 
MPL under each of four service disciplines. 


Observations on the Influence of Processing Steps on the Magnetic Hysteresis 
Parameters of a Co/Fe/Nb Alloy. M. R. Pinnel, IEEE Trans. Magn., MAG-12 
(May 1976), pp. 236-243. The variation of both magnetic and mechanical 
properties and microstructure of a Co/Fe/Nb alloy (Nibcolloy) with changes in 
processing has been characterized. Results indicate the use of a softening anneal 
prior to the final aging anneal can alter magnetic properties. A nonstability of 
coercivity to subsequent brief elevated temperature exposures of around 1000°C 
was also observed. 


Profile Parameters of Implanted-Diffused Arsenic Layers in Silicon. R. B. Fair and 
J. C. C. Tsai, J. Electrochem. Soc., 123 (April 1976), pp. 583-586. Equations 
have been derived that describe the important variables that are required to charac- 
terize the diffusion of As-implanted layers for the surface doping concentration range 
Cro = 1 X 10!° cm=*, In addition, data obtained from differential conductivity 
profile measurements and SIMS profile measurements have been used to obtain 
experimental parameters for these equations. 
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Qualitative Observations on the Diffusion of Copper and Gold Through a Nickel 
Barrier. M. R. Pinnel and J. E. Bennett, Met. Trans., 7A (May 1976), pp. 629-635. 
The interdiffusion behavior in planar-layered couples of Cu/Ni/Au at temperatures 
between 150 and 750°C have been characterized. Results demonstrate that the 
nickel barrier layer retards but does not block the transport of copper to the gold 
surface. Possible mechanisms for the anomalous buildup of copper at the gold/nickel 
interface and gold at the copper/nickel interface are discussed. 


Signaling and Switching As We Enter the Second Century. J. S. Ryan, Telecommun. 
J., 43, No. 111 (March 1976), pp. 206-219. During the first century of the tele- 
phone, switching evolved from the 21-line New Haven switchboard to the 104,000- 
trunk No. 4 ESS, and signaling from voice alerting to CCIS. This centennial issue 
article traces the history and reviews the status of signaling and switching as we 
enter the second century. 


A Study of Deep Levels in GaAs by Capacitance Spectroscopy. D. V. Lang and R. A. 
Logan, J. Electron. Mater., 4, No. 5 (1975), pp. 1053-1066. We show how the 
DLTS capacitance spectroscopy technique can be used to detect small amounts of 
deep-level impurities in GaAs p-n junctions. The DLTS spectra associated with Cu, 
Fe, Cr, O, and two unidentified, but commonly occurring, deep levels in GaAs are 
shown. The LPE distribution coefficients are obtained for Cu, Fe, and Cr. The 
carrier capture cross sections for six levels are measured and give evidence for capture 
by multiphonon emission. 


MATERIALS SCIENCE 


Detection of Catalytic Oscillations by Differential Thermal Analysis. P. K. Gallagher 
and D. W. Johnson, Jr., Thermochim. Acta, 16 (May 1976), pp. 238-240. Oscil- 
lations in DTA curves associated with the oxidation of CO using Pt containing 
catalysts were observed. This offers a relatively quick and simple technique for 
studying instabilities that arise from the interaction of the catalytic mechanism 
and the exothermic nature of the reaction. 


Kinetics of Formation of LiFeO. from 2Li,CO;-Fe,0; Mixture. P. K. Gallagher 
and D. W. Johnson, Jr., J. Amer. Ceram. Soc., 69 (March-April 1976), pp. 171-172. 
Mixtures of 2Li2CO3;-Fe:O3 were found to form LiFeO: at <600°C. Excess Li:COs; 
did not react until higher temperatures. Isothermal and dynamic kinetic studies gave 
an activation energy of 42-50 kcal/mole. A model involving the rapid surface diffusion 
of LizCO; is proposed. 


Diffusion Kinetics of Au Through Pt Films About 2000 and 6000 A Thick Studied 
with Auger Spectrscopy. C. C. Chang and G. Quintana, Thin Solid Films, 31 
(1976), pp. 265-273. Pt-Au couples with 2000A and 6000A Pt films were 
heat treated between 250° and 350°C in 1 atm No». Au was found to diffuse initially 


through Pt films <6000A by grain boundary migration and more than 10% atoms 
cem~? of Au crossed the Pt when the bulk of the Pt contained little Au (S1 at.%). 


For 2250 A Pt on Au, the time for half-saturation of the Pt surface with Au was 
t(0.5) = 1.2 X 1077 X exp (0.96 eV/kT) min. 


Domain Wall Image Contrast in the SEM. D. C. Joy, H. J. Leamy, S. D. Ferris, 
D. E. Newberry,* and H. Yakowitz,* Appl. Phys. Lett., 28 (April 15, 1976), pp. 
466-468. Contrast from domain walls in materials with cubic magnetic anisotropy 
has been observed in scanning electron microscope images. This contrast, which is 
visible in both the backscattered and absorbed current images, arises from the 
interaction of the convergent incident electron beam with the domains on either side 
of the wall. * National Bureau of Standards. 


Ultranarrow, Forbidden, Singlet-Triplet Anticrossings in H,. T. A. Miller and R. S. 
Freund, J. Chem. Phys., 63 (1975), pp. 256-263. Forbidden _singlet-triplet 
anticrossings have been observed between different Zeeman sublevels of thei (8d) *Ig, 
v=1, N = 6 and W(?) '2,t, v = 1, N = 4 states of He. The anticrossings are 
quite sharp and hence allow accurate determinations of the states’ zero field separa-~ 
Hoe linear and quadratic Zeeman parameters, coupling perturbation, and radiative 
ifetimes. 
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PHYSICS 


The Determination of Energy-Level Shifts Which Accompany Chemisorption. 
Homer D. Hagstrum, Surface Sci., 64 (1976), pp. 197-209. This paper discusses 
the ionization limit with respect to which the electronic energy levels of an adsorbed 
complex on a solid surface should be referenced, and how this limit is defined and 
measured. This leads to a reasonable procedure for determining energy-level shifts, 
at least for simple systems. 


The Linear Electric Field Effect for Low Spin Ferric Heme Compounds. W. B. 
Mims and J. Peisach*, J. Chem. Phys., 64, No. 3 (February 1, 1976), pp. 1074-1091. 
Measurements were made on the linear electric field induced g shifts for a series of 
low-spin ferric heme compounds by the electron spin echo method on noncrystalline 
samples at liquid helium temperatures. Some of these samples consisted of proteins 
and protein derivatives; others were made from heme reacted with small ligand 
molecules. Analytic expressions relating the observed changes in spin echo amplitude 
to certain of the g?-shift coefficients b;; were derived. Results suggest the presence 
of extensive back bonding between the Fe*+ ion and the axial ligands of the heme 
complex. Results also suggest that myoglobin hydroxide is characterized by a crystal 
field which is exceptionally low for the S = (1/2) ferric heme group of compounds. 


“Departments of Pharmacology and Molecular Biology, Albert Einstein College 
of Medicine of Yeshiva University. 


Elastic Constants of bcc ‘He. D. S. Greywall, Phys. Rev. B (Nucl. Phys.), 13 (Feb- 
ruary 1976), pp. 1056-1068. Longitudinal and transverse sound velocities were 
measured in single crystals of bec ‘He with known orientation at 21.00 cm?/mole 
and at 1.612 K. The temperature dependence of sound velocities along an isochore 
and along the melting curve was measured for several samples. No premelting effects 
were observed. The ratios of bcc *He to ‘He elastic moduli at the same molar volume 
are considerably larger than the classical ratio of unity but in excellent agreement 
with the quantum-mechanical calculations of Horner. Existing calorimetric data 
are compared with the present determination of the Debye temperature. 


Isotope Abundances in Interstellar Molecular Clouds. P. G. Wannier, A. A. Penzias, 
R. A. Linke, and R. W. Wilson, Astrophys. J., 204, No. 1 (February 15, 1976), 
pp. 26-42. We use the J = 1—J = 0 transitions of }#C'*O, C60, and C180 
at ~ 110 GHz to measure abundance ratios of carbon and oxygen isotopes throughout 
our galaxy. The measured values of [!2C]/[%C] seem to be different than the 
prtelgien values, suggesting significant galactic chemical evaluation since the birth 
of the sun. 
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